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RECOMBINANT ADENOVIRUS AND ADENO-ASSOCIATED VIRUS, 
CELL LINES, AND METHODS OF PRODUCTION AND USE THEREOF 

This invention was supported by the National 
Institute of Health Grant Nos. HD32649-01, DK47757 and 
5 DK49136. The United States government has rights in this 
invention. 

Field of the Invention 

The present invention relates generally to the field 
of somatic gene therapy, and specifically to methods and 
10 compositions useful in the treatment of genetic 
disorders • 

Background of the Invention 

Adenoviruses are eukaryotic DNA viruses that can be 
modified to efficiently deliver a therapeutic or reporter 

15 transgene to a variety of cell types [see, e.g., M. S. 
Horwitz et al, "Adenoviridae and Their Replication", 
Virology , second edition, pp. 1712, ed. B. N. Fields et 
al, Raven Press Ltd., New York (1990)]. Recombinant 
adenoviruses (rAds) are capable of providing extremely 

20 high levels of transgene delivery to virtually all cell 
types, regardless of the mitotic state. The efficacy of 
this system in delivering a therapeutic transgene in vivo 
that complements a genetic imbalance has been 
demonstrated in animal models of various disorders [K. F. 

25 Kozarsky et al, Somatic Cell Mol. Genet. . 19:449-458 
(1993) ("Kozarsky et al I"); K. F. Kozarsky et al, 
Biol. Chem. , 269:13695-13702 (1994) ("Kozarsky et al II) 
and others]. The use of recombinant adenoviruses in the 
transduction of genes into hepatocytes in vivo has 

30 previously been demonstrated in rodents and rabbits [see, 
e.g., Kozarsky II, cited above, and S. Ishibashi et al, 
J. Clin. Invest. , 92:883-893 (1993)]. 

The first-generation recombinant, replication- 
deficient adenoviruses which have been developed for gene 

35 therapy contain deletions of the entire Ela and part of 
the Elb regions. This replication-defective virus is 
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grom on an adenovirus -transformed, complementation human 
embryonic kidney cell line containing a functional 
adenovirus Ela gene ?*hieh provide® a transacting Ela 
protein, the 293 cell [ATCC CKL1573]o El~deleted viruses 
5 are capable of replicating and producing infectious virus 
in the 293 cells, ^hich provide Ela and Sib region gene 
products in £r The resulting virus is capable of 
infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
10 replicate in a cell that does not carry the El region DMA 
unless the cell is infected at a very high multiplicity 
of infection o 

Adeno-associated virus (AAV) is an integrating human 
DMA parvovirus ^yhich has been proposed for use as a gene 

15 delivery vehicle for somatic gene therapy [Bo Jo Carter, 
in 00 Handbook of Parvoviruses 03 , edo, Po Tijsser, CRC 
Press, pp 0 155-168 (1990) ]• This small non-enveloped 
virus contains a 4o5 kb single stranded (ss> DMA genome 
that encodes sets of regulatory and ©apsid genes called 

20 rep and c&p 0 Rep polypeptides (r©p78, rep68, rep62 and 
rep40) are involved in replication, rescue and 
integration of the AAV genome • The amp proteins (VP1, 
VP2 and VP3) form the virion caps id o Flanking the rep 
and cap open reading frames at the 5° and 3° ends are 145 

25 bp inverted terminal repeats (XXRs) , the first 125 bp of 
^hich are capable of forming or T-shaped duplex 
structures o 

Recombinant forms of AAV (rAAV) have been developed 
as vectors by replacing all viral open reading frames 

30 with a therapeutic minigene, uhile retaining the 

necessary eis elements contained in the ITRSo [See, 
eogo, US Patent los 0 4,797,368? 5,153,414? 5,139,941? 
5,252,479? and 5,354,678? and International Publication 
EJoSo WO 91/18088 published November 28, 1991? WO 93/24641 

35 published December 9, 1993 and W094/13788 published June 
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23, 1994]. However, progress towards establishing AAV as 
a transducing vehicle for gene therapy has been slow for 
a variety of reasons. For example, the integrated 
provirus preferentially targets specific sites in 
5 chromosome 19. Additionally, difficulties surround large- 
scale production of replication defective recombinants. 
The cells employed to produce rAAV must also be infected 
with adenovirus or herpesvirus to provide the necessary 
helper functions, thereby producing problems in purifying 

10 recombinant AAV (rAAV) from contaminating virus in 

culture. Practical experience with purified recombinant 
AAV as a gene therapy vector has been disappointing, 
because the more purified the AAV is from co-infection 
with its helper virus in culture, the lower the gene 

15 transduction efficiencies that the rAAV displays. 

There remains a need in the art for additional 
recombinant adenoviruses and rAAV, therapeutic 
compositions and methods which enable effective use of 
these recombinant viruses in the treatment of disorders 

20 and diseases by gene therapy. 
Summary of the Invention 

In one aspect of this invention, a packaging cell 
line is provided which expresses adenovirus genes Ela, 
Elb and E4, or functional fragments thereof, e.g., the E4 

25 open reading frame (ORF) 6. 

In another aspect, the invention provides a rAd 
comprising the DNA of at least a portion of the genome of 
an adenovirus having functional deletions of the El and 
E4 gene regions; a suitable gene operatively linked to 

30 regulatory sequences directing its expression, and an 
adenovirus capsid, the rAd capable of infecting a 
mammalian cell and expressing the gene product in the 
cell in vivo or in vitro. The invention also provides a 
mammalian cell infected with the rAd described above. 
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In still another aspect, the invention provides a 
rAd shuttle vector comprising the DMA of at least a 
portion of the genome of an adenovirus having functional 
deletions of the SI and S4 gene regions • 
5 In a further aspect, the invention provides a method 

for producing the above-described recombinant Ad and a 
method for delivering a selected gene into a mammalian 
cell using the recombinant Ad described above o 

In another aspect, the invention provides a method 

10 for enhancing the efficiency of transduction of a 

recombinant AAV into a target cello The method operates, 
in brief, by infecting a target cell with a ss 
recombinant adeno~associated virus (rAAV) which comprises 
a transgene ©peratively linked t® regulatory sequences 

IS directing its expression, and contacting the infected 
cells with an agent which facilitates the conversion of 
ss rAAV to its double stranded (ds> formo Conversion of 
ss rAAV to ds rAAV occurs in the target cell, resulting 
in enhanced transduction ©f the rAAV into the target 

20 cello The agent may be a helper virus which carries a 
^ selected gene or functional fragment thereof encoding a 
polypeptide capable ©f enhancing the conversion of the ss 
• rAAV to ds rAAV and which is ©o~inf ected into the same 
target cello The agent may also be a drug or chemical 

25 composition which accomplishes the same function and is 
applied to the infected target cello This method can 
operate both in an esr vivo setting and in vivo* 

In yet another aspect, the invention provides a 
novel recombinant AAV, which contains both the transgene 

30 intended for use in treating a genetic disease or 

disorder and at least one additional gene operatively 
linked to inducible or constitutive regulatory sequences » 
The additional gene(s) encodes a polypeptide capable of 
facilitating, alone or in concert with other additional 
" 35 genes, the conversion of ss rAAV to its ds form upon 
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expression o In a preferred embodiment, the additional 
gene is adenovirus or a functional fragment thereof <> 
Also disclosed is a method for enhancing the efficiency 
of transduction of the novel rAAV into a target cello 
5 The novel rAAV and methods of this invention are 

also useful in pharmaceutical compositions for use in ex 
vivo and in vivo gene therapy treatment protocols for 
treating inherited diseases, cancer, and other genetic 
dysfunction© o 

10 Other aspects and advantages of the present 

invention are described further in the following detailed 
description of the preferred embodiments thereof o 
feyfteff pereyftptfroq of the jO^^ofE 

Fig c 1 is a schematic drawing of an exemplary 

15 plasmid pWWB4@RF<5 [SEQ 2D KfOs 1] or pMTE40RF6, which 
contains an KE2W or sheep EST promoter, respectively, in 
control of a human E4 ORF 6 gene sequence, a growth 
hormone gene terminator sequence (GH) , an SV&Q ori, 
pBR322~based plasmid sequences including a neo 11 gene, an 

20 SV40 polyA site and an amp a gene* 

Fig 0 2 is a schematic map of rAd H5o001CBLac£ [SEQ 
ID EJOi 3} with indicated restriction endonuclease enzyme 
sites o The striated bar represents the CBLacZ minigene; 
the black bar represents Ad5 viral backbone, the 

25 crosshatched bar represents Ad E4 deletion o 

Fig* 3 plots Lmc% forming units (I*FU)/ml vs time 
(hours) for E4 complementing cell lines infected with 
M5o001CBL&c£o 

Fig e 4A is a graph of the induction, ORFS expression 
30 and viral production in 293-27=18 packaging cells 

plotting yield at 24 hours post-infection (pi) in LFU/ml 
and ORFS protein (abs 0 mm) vs« concentration of the 
inducer, dexamethasone (jaKI) « AbSomm is the intensity of 
the sise of the protein band on a Western blot and 
35 reflects absorbance and protein size in mm 3 * The square 
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is yield at 24 hours pi. The diamond is ORF6 protein 
detected at 24 hours pi* 

Fig. 4B is a similar graph to that of Fig. 4A, 
except that the packaging cells are 293-10-3 cells. The 
5 symbols are as described for Fig. 4A. 

Fig. 5A is a bar graph plotting B-galactosidase 
enzyme activity in lysates from infected Hela cells. The 
horizontal axis indicates the adenoviruses infected into 
the HeLa cells, with the symbol N +" indicating the 

10 addition of the adenovirus to the rAAV, AV.CMV.LacZ. The 
vertical axis indicates intracellular fl-galactosidase 
specific activity (mUnits/mg protein) using ONPG. Below 
each bar, the fold-induction in specific activity 
relative to cells that received the AV.CKVLacZ vector 

15 alone is given. 

Fig. 5B is a bar graph plotting Ad multiplicity of 
infection (MOI) in HeLa cells of wild-type Ad5 or the E2 
mutant dl802, the cells co- infected with rAAV vs. 
intracellular B-galactosidase specific activity. See 

20 Example 11. 

Fig. 6A is a graph in which B-galactosidase specific 
activity and counts per minute (CPM) are plotted along 
the vertical axis and adenovirus MOI's are on the 
horizontal axis for HeLa cells infected with wtAd5 and 

25 rAAV according to Example 12. Data obtained from low HOI 
(1, 5, and 10) infections are shown. 

Fig. 6B is a graph similar to that of Fig. 6A except 
that the cells were infected with Ad mutant dl802. 
Fig. 7A illustrates a model for leading strand 

30 synthesis of a complementary AAV strand in the presence 
of Rep (+Rep) or absence of Rep (-Rep) • Rep expresses a 
terminal resolution activity that can convert a duplex 
structure with closed-ends to an open-ended duplex. In 
the absence of Rep, terminal resolution is impaired 

35 leaving the covalently closed, hairpin structures intact. 
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Under these conditions, hairpins are expected to be found 
leftward and rightvard, since both strands of a rescued 
ds AAV genome are packaged into virions. 

Fig. 7B is a schematic of linear AV.GKVLacZ with 
5 labeled domains including the AAV ITRs, CMV immediate 
early enhancer /promoter (CMV) , SV40 splice donor-splice 
acceptor (SD/SA) , E. coli 6-galactosidase cDNA (LacZ) , 
and SV40 polyA signal (pA) . Two NotI sites located at bp 
positions 1035 and 4509 are indicated. 

10 Fig. 7C illustrates a closed end and an open end 

fragment of rAV.CMVLacZ. 

Figs. 7D, 7E and 7F indicate the mixture of 
open-ended and covalently closed duplex fragments 
generated by NotI digestion of ss AV.CMVLacZ at position 

15 4509 in the absence of terminal resolution. The NotI 

4509 digestion provides a convenient means of releasing a 
361 bp fragment that contains the right ITR in the 
context of a hybridization target (i.e. SV40 pA) . In the 
presence of terminal resolution, only the open-ended 361 

20 bp fragment would be expected to be generated (Fig. 7D) 
by such digestion. 

Fig. 8A is a bar graph plotting 6-galactosidase 
specific activity (mUnits/mg protein) vs. increasing 
concentration of zinc (fM) inducer for cell line 293 (MT- 

25 0RF6) transduced with AVCMVLacZ (first row below each 
bar) • Also provided is the fold-induction relative to 
293 cells (second row below each bar) , and the 
fold- induction relative to 293(ORF6) cells maintained in 
the absence of zinc (third row) • 

30 Fig. 8B is a bar graph plotting CPM of duplex 

monomer replicative form (RFm) of rAAV vs. the 
concentration of zinc (MM) used for induction and the 
fold-induction relative to 293(ORF6) cells maintained in 
0 mM zinc below each bar. 
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Fig<> 8C is a graphical comparison of the induction 
profiles that describe AVoOWiecS transduction 
efficiency o Specific activity data from Fig<> 8& and CPM 
data of AVoGBZV£@c£ KFm from Figo 8B are plotted along the 
5 vertical axis, and concentration of zinc sulfate used 

during -the experiment is shown along the horizontal axis** 

Fig* 9 is a bar graph plotting specific activity 
(milliunits B-galactosidase/^g protein) vs the 
concentration ©f sine used for induction (first row under 

10 the horizontal assis) , the fold-induction relative to HeLa 
cells (second row) , and the fold-induction relative to 
SJeLa(E3t & ORFS) cells Maintained in the absence of zinc 
(third row) , for the KleLa (E2T-03&F® ) cells transduced at an 
£301 of 2,000 S&oCERZLmcB virus particles /ceil in the 

15 absence of zinc sulfate inducer or in the presence of SO, 
100, 150, 200 or 250 zinc sulfate inducer o 

Fig 0 10 is a schematic of the plasmid p&VoSBSV£ac# 
[SSQ ID EJOs 4]o 

Fig 0 11 illustrates plasmid phV 0 CMVALP <, GRE-0RF6 [SEQ 

20 ID WOs 5]o 

Detailed Descriiotion of the Invention 

The present invention provides packaging cell lines, 
■ which enable the production of recombinant adenoviruses 
(rM) functionally deleted in both the SI and S4 genes « 

25 These r&d and methods which enable the therapeutic 
treatment of disorders with such r&ds are disclosed • 
Novel 00 second generation" recombinant adeno~associated 
virus (r&&V) and Methods for enhancing the transduction 
efficiency of rM? containing a transgene for expression 

30 in a somatic gene therapy protocol are also provided « 

The methods and compositions of this- invention are useful 
in ex vivo applications of gene therapy, such as in the 
transduction of bone marrow cells with desirable 
hematopoietic stem cell progenitor genes prior to bone 

35 marrow transplantation • The embodiments of the invention 
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are also useful in pharmaceutical compositions for direct 
in vivo treatment of patients by gene therapy vectors, 
including the transduction of desirable genes in patients 
with genetic disorders , such as cystic fibrosis. 
5 J. Packaging Cell Lines 

To increase the transgene capacity and decrease 
immune response of rAds, as many viral genes as possible 
should be deleted to inactivate the adenovirus. However, 
it is crucial to generate complementing cell lines for 

10 construction and propagation of such deleted Ad. The 

method and compositions of the present invention overcome 
several problems previously identified in the gene 
therapy for first generation El deleted adenoviruses and 
display advantages in administration particularly to 

15 muscle tissue. 

Early region 4 (E4) of Ad serotype 5 consists of 7 
ORFs believed to be involved in viral DNA replication, 
host cell shut-off, and late mRNA accumulation. To 
generate rAd deleted in E4, the function of the E4 region 

20 must be supplied to the rAd by a helper virus or 

packaging cell line. However, useful packaging cell 
lines have not been available previously because normally 
the continuous expression of functioning Ad El and 
functional E4 in a single cell line are toxic to the 

25 cell. Such cells are therefore not useful for the growth 
and replication of rAds. Further, the DNA encoding the 
functional Ad El and Ad E4 genes, when present in a 
packaging cell line, can increase the chances of 
recombination with a rAd virus to cause the virus to 

30 revert to a wildtype Ad virus. * 

The present invention avoids these problems by 
providing a packaging cell line which contains the Ad5 El 
gene and only the ORF 6 of the Ad5 E4 gene. ORF6 of E4 
alone can provide the requirements for E4 in the viral 

35 life cycle. 
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According to this invention, the ORFS is preferably 
under the transcriptional control of an inducible 
promoter o The souse ©salary tu^or virus (WW) promoter, 
inducible by a glucocorticoid, particularly, • 
5 dexamethasone, is presently preferred o The DNA sequence 
of the K3E3TV promoter spans nucleotides 1-15 OS of SEQ ID 
BJOs lo Another inducible promoter is the sheep 
metallothionine (EST) promoter, inducible by sine [E3o Go ■ 
Peterson et al, Bur P Jo ^loche^n . A2&S417-424 (1988) ]<> 

10 However, the sine sulfate inducer ©f the S3T promoter can 
itself be toxic to the cells o Other inducible promoters, 
such as those identified in International patent 
application 15J095/13392, published E2ay 18, 1995, and 
incorporated by reference herein may also be used in the 

15 production of packaging cell lines according to this - 
invention* Constitutive pr©M®ters, such as the 
constitutive Ad5 S4 region promoter, LTR, may be employed 
in control of the expression of ORFSo 

The packaging cell line ©f the invention ^hich 

20 utilises an inducible promoter permits one to control the 
development of toxicity by regulating the expression of 
the 3S4 ORFS gene e After the desired shuttle vector 
containing the Ad sequences is transf ected into the cell 
line, expression of the 324 ©RFS can be induced by the 

25 appropriate inducer <> The packaging cell is thus able to 
provide both Ad SI and Ad 3S4 ORFS gene products to the 
rAd for a sufficient period to allou productive infection 
and recovery of the rAd, before the cell becomes toxic o 
At present, the time period before the cell experiences 

30 toxicity is about 10 dayso 

In its most preferred form, the packaging cell line 
is a human embryonic kidney (B3SK) 293 El expressing cell 
line into ^hich is introduced the E4 ORF S sequence under 
the control of the inducible promoter «, It should be 

35 understood by one of skill in the art that another parent 
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cell line may be selected for the generation of a novel 
cell line expressing the Ela, Sib, and E4 ORF 6 genes of a 
selected adenovirus serotype o Among such parent cell 
line© say be included HeLa [CCL 2], A549 [CCL 185], KB 
5 [CCL 17], Detroit [e 0 go, Detroit 510, CCL 72] and WI-38 
[ATCC CCL 75] cells o These cell lines are all available 
from the American Type Culture Collection, 12301 Parklawn 
Drive, Roekville, E3D, BSAo Other suitable parent cell 
lines say be obtained fro® other sources <, If such parent 

10 cell line® were ©elected for Modification, the cell line 
would need to be further supplied with the Ela and Elb 
gene functions, e 0 g 0 , such as by transfection with a 
plasaid containing these genes or functional fragments 
thereof under a suitable promoter, as well as with the 

15 :©KF6 gene as described herein 0 

Example 1 teaches construction of packaging cell 
lines containing only the ©KF S of Ad5 S4 region or, for 
functional comparisons, the entire E4 region o Briefly 
described, the entire 24 region and an ORF6 sequence of 

20 Ad 5 S4 gene were obtained by known techniques [see, 

e 0 g 0 , ssmabrook et alo, Molecular Cloning • A Laboratory 
Manual o ra , 2d edito, Cold Spring Harbor Laboratory, New 
York (1989) and references cited therein] * To isolate 
the ORF6 region, the anchored 3PCR technique was used to 

25 amplify the ©KF6 sequence from its initiation codon to 
its termination codon o PriMers selected from the 
published sequence of ORF6 were used to amplify the ORF 
sequence and insert restriction sites onto the end of the 
sequence o The E4 sequence itself is reproduced as 

30 nucleotides 1523 through 2408 of ®%® S® £3®s 1° The 
entire 34 gene sequence is published in the Genbank 
sequence of Ad5 [Genbank Accession Woo £373260] o 

A minigene was constructed that placed the ORF 6 
sequence under the control of a selected promoter <> By 

35 ^minigene 05 as used here is meant the combination of the 
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desired sequence to be esspressed (in this particular 
instance, the OSF6 sequence) and tine other regulatory 
elements necessary to transcribe the desired sequence and 
express the gene product in a cell containing that 
5 minigene o The ORFS sequence gene is ©peratively linked 
to regulatory components in a manner ^hich permits its 
transcription o Such components include conventional 
regulatory elements, such as a promoter to drive ORF6 
egression o One inducible promoter the Zn* 2 
10 inducible KIT promoter? the other ^as the dexamethasone- 
inducible promoter" of SEQ IB> KfOs l D 

The minigene also contains nucleic acid sequences • 
heterologous to the ORFS viral sequence, including 
sequences providing signals required for efficient 
15 polyadenylation of the transcript (poly~A or pA) . A 
common poly~A sequence ^hich is employed in this 
invention is that derived from the growth hormone (GH) 
gene terminator sequence (nuco of SEQ ID EJOs' 

1) o The poly~A sequence generally is inserted in the 
20 minigene following the sequence . The polyA sequence 

employed in the minigene described in Example 1 

[SEQ ID MOz 1] is supplied by the gb gene terminator and 
• an origin of replication (ori) . A similar minigene . 

differing in promoter sequence, polyA sequence and/or 
25 SV40 ori can also be designed by one of skill in the art 
to transfer the E4 OHFS sequence to a shuttle plasmido 
Selection of these and other common vector elements are 
conventional [see, eog 0 „ Sambrook et al, cited above, and 
references cited therein] and many such sequences are 
30 available from commercial and industrial sources as well 
as from- Genbanko 

The ORFS-containing minigene &as subcloned into a 
pBR322=based shuttle plasmid that contained a neomycin 
resistance gene, resulting in the shuttle vector of Fig 0 
35 lo Any of the many knoun bacterial shuttle vectors may 
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be employed to carry the minigene, providing that the 
vector contain© a reporter gene or ©electable marker of 
which many, ©og 0 „ neo, amp or puriMycin, are toora in the 
arte It is expected that one of skill in the art can 
5 develop other suitable shuttle vectors using other 
plasmid components which are similarly capable of 
transferring the OEFS minigene into the chromosome of a 
cell transfected with the plasaido 

As further described in Example 1, other shuttle 

10 vectors were designed for comparative purposes, which 
contain the complete or substantially complete h&5 E4 
region under the control of the constitutive retroviral 
MLV LTR sequence in the presence or absence of the 
endogenous E4 promoter o The shuttle plasmid carrying the 

15 ORF6 minigene (or the entire E4 region) was introduced 

into HEK 293 cells which express the Ad SI gene products o ■ 
Complementing cell lines were generated that express 
these Ad E4 or ORFS genes from either their endogenous 
promoters or heterologous inducible promoters <> These 

20 cell lines are further characterized by their genetic 
constitution, E4 protein ^synthesis 9 recombinant 
helper function, relative plaque efficiency of H5dll004 
virus 9 and growth kinetics of recombinant E1/E4 deleted 
adenovirus o These characteristics of exemplary 81/24 

25 expressing packaging cell lines are discussed in detail 
in the following examples * 

II o Recombinant Adenovirus 
The E1/E4 expressing cell line is useful in 
constructing E1/E4 deleted r&ds which can deliver a 

30 suitable gene to mammalian cells and tissues o These rAd 
are functionally deleted in at least the 21a, Sib and E4 
Ad gene regions <> By the term ra functionally deleted* 3 is 
meant that a sufficient amount of the gene region is 
removed or otherwise damaged, e 0 g 9cr by mutation or 

35 modification, so that the gene region is no longer 
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capable of producing the products of gene egression . " If 
desired, the entire gene region may be removed « In in 
vivo experiments with the srM grown in the packaging cell 
lines , the E1/E4 deleted rAd demonstrated utility 
5 particularly in transferring a transgene to a muscle 
cello 

The adenovirus sequences used in the construction of 
the shuttle vectors, helper viruses, if needed, and r&d, 
and other components and sequences employed in the 

10 construction of the vectors and viruses described herein 
may be readily obtained from commercial or academic 
sources based on previously published and described 
sequences o Viral materials ^say also be obtained from an 
individual" patient 0 The viral sequences and vector 

15 components May be generated by resort to the teachings 
and references contained herein, coupled with standard 
recombinant molecular cloning techniques known and 
practiced by those skilled in the arte Modifications of 
existing nucleic acid sequences forming the vectors, 

20 including sequence deletions, insertions, and other 

mutations taught by this specification may be generated 
using standard techniques o Similarly, the methods 
employed for the selection of viral sequences useful in a 
vector, the cloning and construction of the ^minigene 03 

25 and its insertion into a desired viral shuttle vector and 
the production of a recombinant infectious virus are 
within the skill in the art given the teachings provided 
herein • 

Construction of the Transgene 
30 A ^minigene** in this context is defined as 

above, except that the components of this minigene are 
designed to express the gene product ex vivo or in vivoo 
Such components include conventional regulatory elements 
necessary to drive expression of the transgene in a cell 
35 transfected with the r&d„ For this minigene, a selected 
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promoter ie operatively linked to the transgene and 
located , with other regulatory element© , within the 
selected viral sequence© of the recombinant vector. 
Selection of the promoter is a routine matter and is not 
5 a limitation of this invention . Useful promoters may be 
constitutive promoters or regulated (inducible) 
promoters, which Hill enable control of the amount of the 
transgene to be eaepressed. For example, a desirable 
promoter is that of the cytomegalovirus (GB3V) immediate 

10 early promoter/ enhancer [see, e.g., Boshart et al, 

jL&s 521-530 (1985) ]o Mother desirable promoter includes 
the Rous sarcoma virus IsTR promoter/ enhancer » Still 
another promoter/ enhancer sequence is the chicken 
cytoplasmic &~actin (CB) promoter [To A. Kost et al, 

15 Wucl o Acids Res . „ JJL(23)s8287 (3.983)]. Other suitable 
promoters may be selected by one of skill in the arto 

The minigene say also desirably contain nucleic 
acid sequences heterologous to the viral vector sequences 
including poly-A sequences and introns with functional 

20 splice donor and acceptor sites, as described above o The 
poly-A sequence generally is inserted in the minigene 
following the transgene sequences and before the 3° 
adenovirus sequences o A minigene of the present 
invention may also contain an intron, desirably located 

25 between the promoter/enhancer sequence and the transgene . 
Selection of these and other common vector elements are 
conventional as described above and many such sequences 
are available from commercial and industrial sources as 
Hell as from Genbank. 

30 As above stated, the minigene is located in the 

site of any selected deletion in the rAd. In the E1/E4 
deleted rAd H5o001CBL©c#, the transgene is located in the 
deleted El gene region. However, the transgene may be 
located elsewhere in the adenovirus sequence, as desired. 
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B. Production of Recombinant Adenovirus 
Adenovirus sequences useful in this invention 
may include the DNA sequences of a number of adenovirus 
types , which are available from Genbank, including type 
5 Ad5 [Genbank Accession No. M73260]. The adenovirus 
sequences may be obtained from any known adenovirus 
serotype, such as serotypes 2, 3, 4, 7 , 12 and 40, and 
further including any of the presently identified 41 
human types [see, e.g., Horwitz, cited above]. Similarly 

10 adenoviruses known to infect other animals may also be 

employed in the vector constructs of this invention. The 
selection of the adenovirus type is not anticipated to 
limit the following invention. A variety of adenovirus 
strains are available from the American Type Culture 

15 Collection, Rockville, Maryland, or available by request 
from a variety of commercial and institutional sources. 
In the following exemplary embodiment an adenovirus, type 
5 (Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 

20 adenovirus shuttle vectors based on different human 

adenovirus serotypes. It is anticipated that a library 
of such plasmids and the resulting rAds would be useful 
in a therapeutic regimen to evade cellular, and possibly 
humoral, immunity, and lengthen the duration of transgene 

25 expression, as well as improve the success of repeat 

therapeutic treatments. Additionally the use of various 
serotypes is believed to produce rAd with different 
tissue targeting specificities. Additionally, the 
absence of adenoviral genes El and E4 in the rAd of this 

30 invention should reduce or eliminate adverse CTL 
responses which normally cause destruction of rAds 
deleted of only the El gene. 

rAds of this invention are recombinant, 
defective adenoviruses (i.e., El deleted) which are also 

35 deleted completely or functionally of the E4 gene region. 
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Functional deletions of E4 gene regions may be assessed 
by assays of Examples 2 and 3, among other assays » rAds 
useful in this invention ©ay optionally bear other 
mutations, e<>go, temperature sensitive (ts) mutations in 
5 the E2a gene region, and deletions in the S3 gene 
regions o 

An adenovirus of this invention contains a 
functional deletion of the adenoviral early immediate 
early gene Ela (which spans mu 3U3 to 4o5) and delayed 

10 early gene Elb (which spans mu 4US to U.2). Similarly 
the adenovirus has a functional deletion of the whole E4 
region (which spans ©u 92 to 97 o 2), or of at least ORFS 
of the E4 region o Gene region© which may be optionally 
deleted in the E1/E4 deleted rAd of this invention 

15 include all or a portion of the adenovirus delayed early 
gene E3 (which spans mu 76 oS to 8So2)o The function of 
S3 is irrelevant to the function and production of the 
rAdo 

The rAd of this invention may also have a 

20 mutation which results in reduced expression ©f 

adenoviral protein and/or reduced viral replication • For 
example, a ts mutation may be introduced into the 
adenovirus delayed early gene E2a (which spans mu 67 . 9 to 
61o5)o Among such mutations include the incorporation of 

25 the missense ts mutation in the (DBF) E2a region found in 
the Ad5 H5tsl25 strain [Po Vander Vliet et al, Jo Virol <, „ 
2J5S348-354 (1975)] at 62*5 mu* A single amino acid 
substitution (62 o 5 mu) at the car bossy end of the 72 led 
protein produced from the E2a gene in this strain 

30 produces a protein product which is a ss DNA binding 

protein and is involved in the replication of adenoviral 
genomic DHAo At permissive temperatures (approximately 
32 °C) the ts strain is capable of full life cycle growth 
on HeLa cells, while at non°permissive temperatures 

35 (approximately 38 ®C) no replication of adenoviral DMA is 
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seen. In addition, at non-permissive temperatures, 
decreased iiimunoreactive 72 kd protein is seen in HeLa 
cells. See, e.g., J. Fo Engelhardt et al, Him 0 Gene 
acher^- , Jisl217-1229 (1994)? Jo Fo Engelhardt et al, grog, 
5 Hatl. Acad. Sci., PSA . 21*5196-6200 (1994) and 

International patent application W095/13392, published 
May 18, 1995, incorporated by reference herein • 

IHIowever, it should be understood that other 
deletions in the adenovirus genome as previously 

10 described in the art or otherwise May also occur in the 
rAds of this invention • One Minimal type of rAd can 
contain adenovirus genomic sequences fro® which all viral 
genes are deleted 0 Klore specifically, the adenovirus 
sequences May be only the cis-acting 5 0 and 3 0 inverted 

15 terminal repeat (ITS) sequences of an adenovirus (which 
function as oris) and the native 5° packaging/ enhancer 
domain, that contains sequences necessary for packaging 
linear Ad genomes and enhancer elements for the 81 
promoter . The adenovirus 5° sequence containing the 5° 

20 ITR and packaging/enhancer region (Ad5 mu 0-1 or bp 1= 

3 SO) can be employed as the 5° adenovirus sequence in rAd 
of this invention o The 3 0 adenovirus sequences including 
the right terminal (3°) ITR sequence of the adenoviral 
genome spanning about bp 35,353 — end of the adenovirus 

25 genome, or map units "98 « 4=100 may be desirably employed 
as the 3° sequence of the rAdo These sequences, which 
are clearly devoid of the El and S4 genes, can flank, or 
be operatively associated with the minigene in a rAdo 
Any other necessary Ad gene products will then be 

30 supplied by helper viruses and the S1/S4 0RF6 expressing 
packaging cell of this invention o 

Exemplary rAd for use in this invention, for 
example, ©ay be obtained by homologous recombination of 
desired fragments from various rAd, a technique which has 

35 been commonly employed to generate other rAd for gene 
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-therapy use,, Ira the examples holow ff a representative 
rAd, ESoOOlGBX&cZt is constructed by homologous 
recombination between the adenovirus dl!004 (also 
385dlX004) viral backbone and pRt&GBL&cZ minigene DBJAo 
5 185dllOQ4 is an Ad5 virus deleted of from about Map unit 
92 ol through Map unit 98 , ± 0 ©, substantially the entire 
S4 gene 0 The dl!004 virus is described in Bridge and 
Ketner, Jo Virol o „ S32 f21 §531-638 (Febo 1989) • 

The pK&QBL®c% vector is a cDNA plasmid 

10 containing Ad m 0 Uo 0=1^ an SI deletion into ^rhich is 
inserted a bacterial ]S~galact@sidase gene under the 
control of a chicken J8~aetin promoter , with other 
regulatory elements a© described below, and flanked by Ad 
MoUo 9-16 and plasmid sequence* 

15 The production of the E1/B4 rAd of this 

invention in the packaging cell line of this invention 
utilises conventional techniques o Such techniques 
include conventional cloning techniques of cDESA such as 
those described in tessts [Sambrook et al, cited above] , 

20 use of overlapping oligonucleotide sequences of the 
adenovirus genomes , PGR and any suitable method which 
provides the desired nucleotide sequence • Standard 
transfection and co-transfection techniques are employed, 
e 0 g 0 p GaP0 4 transfection techniques using the 

25 complementation 293 cell line 0 Other conventional 

methods employed include homologous recombination of the 
viral genomes, plaquing of viruses in agar overlay , 
methods of measuring signal generation, and the llke 0 
For example, following the construction and 

30 assembly of the desired minigene-containing plasmid 
vector pAdCBLac#, the E1/E4 esspressing packaging cell 
line of this invention is infected with the helper virus 
H5dll004o The infected cell line is then subsequently 
transfeeted with an adenovirus plasmid vector by 

35 conventional methods « Homologous recombination occurs 



WO 96/39530 




PCT/US96/10245 



20 

between the E4-deleted H5dll004 helper and the ph&CBL&cZ 
vector, which permits the adenovirus~transgene sequences 
in the vector to be replicated and packaged into virion 
capsids, resulting in the r&do About 30 or More hours 
5 post -transfeet ion , the cells are harvested, an extract 
prepared and the r&d containing the L&cZ transgene is 
purified by buoyant density ultracentrifugation in a CsCl 
gradient o 

XXX * Us<& of the Recombinant Wlzns In Gene Therapy 

10 The r&d containing the transgene produced by 

cooperation of the adenovirus vector and B4 deleted 
helper virus and packaging cell line, as described above, 
provides an efficient gene transfer vehicle which can 
deliver the transgene in a pharmaceutical composition to 

15 a patient in vivo or oar vivo and provide for integration 
of the gene into a Mammalian cello 

The r&d are administered to humans in a conventional 
manner for gene therapy and serve as an alternative or 
supplemental gene therapy for the disorder to which the : 

20 transgene is directed o h r&d of this invention ©ay be 
administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
• acceptable delivery vehicle o A suitable vehicle includes 
sterile saline o Other aqueous and non-aqueous isotonic 

25 sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceutically 
acceptable carriers and well known to those of skill in 
the art may be employed for this purpose o 

The rAd are administered in sufficient amounts to 

30 transfect the desired target cells, e 0 g 0 , muscle, liver, 
epithelial, etc 0 and provide sufficient levels of 
transfer and escpression of the transgene to provide a 
therapeutic benefit without -undue adverse or with 
medically acceptable physiological effects which can be 

35 determined by those skilled in the medical artSo 
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Conventional and pharmaceutically acceptable routes of 
administration include direct delivery to the muscle or 
other selected cell, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal, oral and other 
5 parental routes of administration <> Routes of 
administration may be combined, if desired e 

Dosages of r&d uill depend primarily on factors such 
as the condition being treated, the age, weight and 
health ©f the patient, and may thus vary among patients o 

10 For example, a therapeutically effective human dose of 
the rAd is generally in the range of from about 20 to 
about 100 ml of ©aline solution containing concentrations 
of from about IsslO 9 to IsslO 11 pfu/ml virus <, A preferred 
human dose is estimated to be about SO ml saline solution 

15 at 22S10 10 pfu/ml o The dose will be adjusted to balance 
the therapeutic benefit against any side effects „ The 
levels of expression of the transgene can be monitored to 
determine the frequency of administration „ 

An optional method step involves the oo~ 

20 administration to the patient, either concurrently with, 
or; before or after administration of the rAd of a 
suitable amount of a short acting immune modulator « The 
selected immune modulator is defined herein as an agent 
capable of inhibiting the formation of neutralizing 

25 antibodies directed against the recombinant vector of 

this invention or capable of inhibiting or substantially 
delaying cytolytic T lymphocyte (CTL) elimination of the 
vector o Among desirable immune modulators are 
interleukin~12 [European Patent Application Moo 441,900]; 

30 gamma interferon [So Co EJorris et al, Jo Immunol o „ 

151s 1047 (1994)]? interleukin-4 [United States Patent BJOo 
5,017,691]; antibody to the CD4 protein, such as anti-OKT 
3+ [see, e 0 go, US Patent BJOo 4,658,019] or antibody GKloS 
(ATCC Accession KOo TIB207) ? a soluble CD40 molecule or 

35 an antibody to CD40 ligand (Bristol-Myers Squibb Co) 



WO 96/39530 




PCT/US96/10245 



22 

[European patent application 555,880, published August 
18, 1993]? a soluble for® of B7 or an antibody to CD28 or 
CTL&4 [CTLM-Xg (Sristol-Eiyers Squibb Co) , European 
patent application SOS, 217, published July 20, 1994], or 
S agents such as cyclosporin & or cyclophosphamide o Thus, 
the pharmaceutical compositions and Methods of this 
invention provide a desirable gene therapy treatment o 
Wo Recombinant &&eno~&S8oc±®t<8& Virus 
In the following contesst the term w transgene 00 means 

10 a nucleic acid sequence or reverse transcript thereof, 
heterologous to the MM sequence, which encodes a 
polypeptide or protein of interest o The transgene may be 
operatively linked to regulatory components in a manner 
which permits transgene transcription, io®o, the 

15 transgene is placed into operative association with a 

promoter, a© well as other regulatory sequences, such as 
SWSO introns or polyA sequences, useful for its 
regulation o The composite association of the transgene 
with its regulatory sequences is referred to herein as a 

20 minicassette or minigeneo 

The composition of the transgene or minicassette 
sequence will depend upon the use to which the resulting 
r&AW will be puto- For example, one type of transgene 
sequence includes a reporter sequence, which upon 

25 expression produces a detectable signal o Such reporter 
sequences include without limitation, an So coli B° 
galactosidase {LmcZ) cDWA, an alkaline phosphatase gene 
(ALP) and a green fluorescent protein gene 0 These 
sequences, when associated with regulatory elements which 

30 drive their expression, provide signals detectable by 
conventional means, e o g 0 , ultraviolet wavelength 
absorbance, visible color change, etCo 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 

35 in a host cello These therapeutic nucleic acid sequences 



WO 96/39530 




PCT/US96/10245 



23 

typically encode products for administration and 
expression in a patient in vivo or ex vivo to replace or 
correct an inherited or non-inherited genetic defect or 
treat an epigenetic disorder or disease. Such transgenes 
5 may be readily selected by one of skill in this art and 
the design of the transgene or the minicassette for 
insertion into the rAAV is not a limitation of this 
invention. 

The term "rAAV encompasses any recombinant AAV gene 
10 therapy vehicle of the prior art, including the AdAAV 

hybrid virus described in published International Patent 
Application No. W096/13598, published May 9, 1996. More 
specifically, rAAV defines a rAAV comprising: (a) the DNA 
of at least a portion of the genome of an AAV, which 
15 portion is capable of transducing into a target cell at 
least one selected gene in the absence of cell division; 
and (b) at least one selected gene (or transgene) 
operatively linked to regulatory sequences directing its 
expression, the gene flanked by the DNA of (a) and 
20 capable of expression in the target cell in vivo or in 
vitro. 

Other rAAVs have been described in the art. The 
method of this invention is not limited by the precise 
nature of the AAV sequences used in the rAAV, provided 

25 that at a minimum both the 5 9 and 3 1 AAV inverted 

terminal repeats are present. Thus, the rAAV may be 
selected by one of skill in the art, and is not itself a 
limitation on this invention. The rAAVs specifically 
disclosed herein are illustrative. 

30 By the term "transduction" is meant that the rAAV 

produced by practice of the invention is capable of 
infecting a desired target cell and expressing the 
transgene in the cell by harnessing the cell's machinery. 
Transduction may include stably integrating the viral DNA 

35 into a chromosome of the target cell. "Enhanced 
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transduction 03 is deff ined as the ability of the r&AV in 
the presence off a conversion agent to transduce the 
target cell, either in vitro , <&x vivo or in vivo g at an 
efficiency greater than a typical prior art rA&W produced 
5 in, and purified from, a culture co-infected with an 
adenovirus or herpesvirus helper <> 

This Method is based on the observation that the 
limiting step in rM? mediated transduction of cells for 
gene therapy is not the internalization or transfer of 

10 the ss viral genome, but rather the subsequent conversion 
of the single=stranded - (ss) viral genome to a 
transcriptionally active double-stranded (ds) fform<> 
Formation of ds D&& intermediates is necessary for 
recombinant gene esspression, which is likely to be 

15 modulated by viral and cellular factors through 

posttranscriptional Mechanisms o The inventors have 
designed a method to ©vercoae this rate^limiting step, 
thereby enhancing transduction ability of an" rA&V and 
ultimately the use of r&AV in gene therapy protocols , 

20 This method of the present invention may employ a 

conventionally prepared ss r&AV containing a transgenic 
The prior art produces ss rM¥ by co-inf ection in culture 
• with a helper adenovirus or herpesvirus, followed by 
purifying the rMV from the culture contaminants 

25 including the helper virus, and infecting the target cell 
with the r&AV alone o The present invention provides for 
infecting a target cell with a ss rAAVo . However, once 
the target cell is infected, the infected cell is 
contacted with an agent which facilitates the conversion 

30 of the ss r&AV to the ds form of r&AVo The action of 
this 00 facilitating agent ro or "conversion agent 00 causes 
the ss to ds conversion to occur in the target cell, 
resulting in enhanced transduction of the recombinant AAV 
into the target cello By facilitating the conversion of 

35 ss to ds r&AV in the target cell, the method of this 
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invention may also result in both transduction and stable 
chromosomal integration of the rAAV into the chromosome 
of said host cell. 

Preferably, for use of this invention the 
5 "facilitating or conversion agent" may take several 
forms. 

A. The Conversion Agent is a Helper Virus 
In one embodiment, the agent is a helper virus 
and the method includes an additional step of co- 

10 infecting the target cell with the helper virus. The 
helper virus useful in this method contains a selected 
gene which can facilitate the conversion of ss rAAV to ds 
rAAV. The selected gene may encode a gene product or 
polypeptide (or a functional fragment of the polypeptide 

15 which shares the biological activity of the full-length 
polypeptide) which enhances the conversion. 
Alternatively, the selected gene may express an antisense 
or ribozyme which functions in the cell to block or 
inhibit a cellular gene that normally prevents ss to ds 

20 conversion of the rAAV. These genes may also be employed 
in the second generation rAAV described below. 

The helper virus is capable of expressing the 
selected gene product in the target cell in the absence 
of cell division. The helper virus may be a wild-type or 

25 mutant adenovirus. The helper virus may alternatively be 
a wild-type or mutant herpesvirus. Preferably, for use 
as facilitating agents, such viruses are mutants deleted 
of several normal genes so that the helper viruses and/or 
their expressed gene products will not cause disease in a 

30 patient. 

For example, a helper adenovirus useful in this 
invention may express only a gene product of a single 
adenoviral early gene. Exposure of the ss rAAV to an Ad 
early gene product is sufficient to substantially enhance 
35 the formation of ds rAAV genome with a coordinate 



WO 96/39530 




PCT/US96/H02415 



26 

increase in transduction efficiency o The Ad early genes 
which are useful in producing this effect are El, E2a, E4 
and functional fragments thereof * However , as 
demonstrated by the examples below, adenovirus 
5 substantially enhances recombinant AAV. transduction in 
vitro in a nay that is dependent on expression of the SI 
and E4 genes of adenovirus and is directly proportional 
to the appearance of ds replicative forms of rAAVo 

One example of a helper virus is an adenovirus 

10 deleted of aost of its wild-type early genes and which is 
capable of expressing only its E4 gene or a functional 
fragment thereof in the target cello Among such 
functional fragments is the ORF <S of the E4 geneo As 
described beJ.ow in the examples, + experiments in cell 

15 lines indicate that the ORF6 of the adenoviral E4 gene 
locus is sufficient to significantly enhance rAAV 
transduction o Selective expression of the E4~ORFS 
product of adenovirus accomplishes a increase in 
transduction efficiency similar to, but somewhat 

20 attenuated, compared to that produced by exposure to the 
El and E4 gene products in combination „ That is, the 
ORFS product of E4 is sufficient to enhance the 
augmentation of rAAV transduction? but this effect is 
amplified substantially by El gene products <> 

25 Thus, more preferably, exposure of the rAAV to 

both the expressed El and E4 gene products produces a 
substantial enhancement of the above-described rate 
limiting stepo Therefore, another exemplary helper virus 
may also contain more than one gene which, upon 

30 expression, facilitates the ss to ds conversion „ An 
example of such a helper virus is an adenovirus which 
expresses both the El and E4 genes, or functional 
fragments thereof o Still other Ad genes may be expressed 
by the helper virus, provided that the virus is 

35 sufficiently crippled so that it does not cause disease 
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in the patient contributing the target cell©*, 

Where the agent which facilitates conversion of 
ss to ds rAAV is a helper virus , the method of the 
invention comprises co-infecting the target cell with the 
5 rAAV and the helper virus* Such co-infection may occur 
in the context of ex vivo therapy , ioe»^ Manipulations 
performed on cells extracted fro® the patient , which 
cells are reinserted into the patient after the method is 
performed o Alternatively, the patient may be directly 

10 co- infected with the two viruses by conventional means*, 
Delivery of the two viruses to the patient may be 
directed to a specific organ , or to the general 
circulatory system <> Such delivery methods are described 
in the art for gene therapy of, « 0 go A cystic fibrosis 

15 [see, @og. ; US Patent KfOo 5,240,84®] o 

So The Conversion Agent is © Chemical? Drug 
or Other Entity that can Activate r&AW Transduction 

In another embodiment of the method of this 
invention, the conversion agent which contacts the cells 

20 infected with the rAAV may be selected from the following 
classes of known compounds or methods s 1) inhibitors of 
DEJA synthesis such as hydroxyurea, hydrogen peroxide, and 
other direct or indirect inhibitors of DMA polymerase? 2) 
chemo-therapeutic agents that induce DMA damage, such as 

25 cyclophosphamide, alley lating agents, purine analogs, 

e 0 g 0 , 6-thioguanine, etc,*? 3) drugs that interfere with 
DJJA modifying enzymes, such as inhibitors of 
topoisomerase, DMA ligase exonucleases and endonucleases; 
and 4) agents that nonspecif ically enhance transcription, 

30 such as sodium butyrate, or agents that stabilize cells, 
such as DMSOo Also, genotoxic agents such as carcinogens 
may be employed as the conversion agent o Other methods 
of inducing disruption or damage to DNA may also be 
useful as agents capable of facilitating ss to ds 

35 conversion of rAAV and maybe selected by one of skill in 
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the art, including physical Methods, such as irradiation . 
These ©lasses of compounds or nethods are believed to 
result in the conversion from ss to ds rAAVo 

According to this embodiment of the method of 
5 the invention, the rAAW is again produced conventionally, 
but not co-infected with a helper virus . The ss rA&V is 
infected into the target cell, and the infected cell is 
contacted by the agent in an appropriate Manner depending 
on the identity of the agent • These conversion enhancing 

10 agents can be employed in ox vivo treatment of the target 
cells infected by the rA&V by application directly to the 
cells o Such application can occur substantially 
simultaneously, or consecutively, with application of the 
rMV "gene therapy vehicle „"* For example, the infected 

15 target cell ©ay be subjected to one of the above-listed 
compounds or drugs for a desired time period • The 
parameters for contacting the infected cells with the 
©gent may readily be determined by one of skill in the 
arto These parameters will depend upon whether the 

20 method is performed ear vivo or in vivo* For example, the 
number of <bx vivo infected cells t© be treated will be 
considered for the dosage, and timing of such treatment • 
Similarly, the physical status of the patient 
can determine the parameters of delivery of the agent to 

25 the patient in vivoo The dosage and amount of the 

damaging agent may therefore be adjusted by one of skill 
in the arto 5Jhere the agents are typical 
chemotherapeutic drugs approved for use in humans or 
animals, such enhanced conversion of rAAV may also occur 

3d in vivo by the co-administration of the agent, i 0 e<>, the 
chemotherapeutic drug, and the rAAV gene therapy vehicle 
to the patient o According to this aspect of the 
invention, the chemotherapeutic. drug would be 
administered only when the rA&V is administered . 
35 Appropriate dosages and amounts of chemotherapeutic drugs 
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and recombinant gene therapy vehicles and means for 
determining such amounts are within the skill of the art. 
However, because the effect of the chemotherapeutic drug 
will enhance the ss to ds conversion of the rAAV and thus 
5 enhance its efficiency of transduction into the target 
cells, it is anticipated that lower dosages than the 
conventional dosages of either or both the drug and the 
rAAV could be effectively administered. 

C. Conversion Agent May be Part of the rAAV. 

10 In still another embodiment of this invention, 
a novel "second generation" rAAV may be designed to 
incorporate the conversion agent into the virus, so that 
both the transgene and the conversion agent are co- 
expressed in fche target _cell. . Such a novel recombinant 

15 adeno-associated virus comprises the following 
components: 

(a) the DNA of at least a portion of the 
genome of an adeno-associated virus which portion is 
capable of transducing at least two selected genes or 

20 functional fragments thereof into a target cell in the 
absence of cell division; (b) a first selected gene, 
i.e., the desired transgene, operatively linked to 
regulatory sequences directing its expression, and (c) a 
second selected gene, i.e., the "conversion gene" 

25 operatively linked to regulatory sequences capable of 
directing expression of said second gene. The 
"conversion gene" upon expression is capable of 
facilitating the conversion of the ss rAAV to its ds form 
upon expression. The first and second genes in this rAAV 

30 are flanked by the AAV DNA, preferably the 5' and 3» 

ITRs. An embodiment of such a second generation rAAV is 
provided schematically in Fig. 11. Its DNA sequence is 
provided in SEQ ID NO: 5. 

Another embodiment of such a novel rAAV may 

35 include more than one gene which upon expression has the 
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ability to facilitate conversion of ss to ds rAAV in the 
target cell. For example, the novel rAAV described above 
may also contain an additional selected gene operatively 
linked to regulatory sequences capable of directing its 
5 expression, the additional gene and said second 

••conversion" gene described above being capable of 
jointly facilitating the conversion of ss rAAV to its ds 
form upon expression of both the second and additional 
genes. In this rAAV, all three genes, i.e., the 

10 transgene, the second "conversion" gene and the 
additional gene are flanked by the AAV DNA. 

In one desirable embodiment of a novel rAAVm 
the AAV ITRs flank a selected transgene, and a conversion 
gene, which is the adenovirus E4 gene or a functional 

15 fragment thereof (e.g., the ORF6 sequence^. In another 
embodiment, the novel recombinant expresses three genes, 
the transgene, the adenovirus E4 gene or a functional 
fragment thereof and the adenovirus El gene or a 
functional fragment thereof. The E4 and El gene products 

20 expressed in the target cell with the transgene, together 
act to facilitate conversion of the ss to ds form of 
rAAV. 

In still another embodiment of the novel rAAV 
and its use, the regulatory sequences directing 

25 expression of the conversion gene, e.g., whether it be a 
single second gene or more than a single additional gene, 
may include an inducible promoter. Thus, expression of 
the conversion gene occurs only in the presence of an 
inducing agent. Many inducible promoters and companion 

30 inducing agents, e.g., steroids such as glucocorticoids, 
are known to the art and may be readily selected for 
incorporation into the rAAV and methods of this invention 
by one of skill in the art with resort to this 
description. 

35 The method of the invention employing such 
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ra second generation 03 r&AVs which carry at least one 
^conversion gene ra provides for inf ecting the target cell 
with this ©s rMVo Uhere the promoters directing 
expression of both the transgene and the conversion gene 
5 ©re constitutive, the infected target cell Machinery will 
direct the expression of the transgene product and 
conversion gene product o Co-esspression in the target 
cell of the transgene and the ^conversion gene 00 
facilitates the conversion of @s r&AW to ds r&AV in the 

10 cell, and increases the transduction efficiency, and 

perhaps stable chromosomal integration, without further 
method steps o 

?3hen the second generation xMM employed in the 
. method contains the Q conversion gene(s.)L ra under th<s 

15 control of inducible promoter (s) , the method is slightly 
altered . Following infection of the target cell by the 
r&&W ? the infected target cell is contacted with a 
suitable inducing agent, which triggers the inducible 
promoter to ra tum ©n ra production of the conversion gene 

20 product o EJhen the inducing agent is removed or stopped 9 
the esspression of the conversion gene product is °°turned 

Off 00 o 

As described above, any prior art rAAV 
containing a transgene for gene therapy may be used in at 

25 least one embodiment of the above methods o The sources , 
selection and assembly of the various components to 
generate the r&&V, including the novel. rAAV described 
above, are now conventional and readily accessible to one 
of skill in thisart, given the disclosure contained 

30 herein o Such methods employ conventional genetic 

engineering techniques [See, e 0 g 0 Sambrook et al, cited 
above] „ 

The novel rAAV viruses and the methods of this 
invention provide efficient gene transfer vehicles for 
35 somatic gene therapy and are suitable in pharmaceutical 
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compositions for ®x vivo applications and in vivo use« 
ftfhen rMV contain a therapeutic gene, @<,go, in place of 
the LmoZ tr&nsgene illustrated in the exemplary rAAV, 
AVoCKIV&ac£, . by use of the rAAV and the methods described 
5 herein, the therapeutic transgene can be delivered to a 
patient in vivo or ax vivo to provide for efficient 
transduction, and possibly stable integration, of the 
desired gene into the target cello Thus, these novel 
rAAV and the Methods described herein can be employed to 

10 correct genetic deficiencies or defects o The potential 
of AAV to efficiently integrate its genome into 
nondividing cells is currently being exploited in the 
development of gene therapies based on ex vivo 
transduction of ^hematopoietic stem cells o In vivo 

15 application of rAAV is primarily being developed for the : 
treatment of €F where purified stocks of virus are 
instilled into the airway to transduce the terminally 
differentiated epithelial cells of conducting airway o 
The methods and compositions described herein can be used 

20 with both types of gene therapy * Another condition 

suitable for such use includes transduction of the low 
density lipoprotein (LDL) receptor gene into hepatocytes 
for the treatment of familial hypercholesterolemia o One 1 
of skill in the art can generate any number of rAAV which 

25 can be used via the above methods for the treatment of 
these and other disorders » 

For ex vivo or for in vivo therapy, the rAAV 
may be used to infect the target cells by suspending the 
virus particles in a biologically compatible solution or 

30 pharmaceutical^ acceptable delivery vehicle <> A suitable 
vehicle includes sterile saline o Other aqueous and non- 
aqueous isotonic sterile injection solutions and aqueous 
and non-aqueous sterile suspensions known to be 
pharmaceutical^ acceptable carriers and well known to 
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those of skill in the art nay be employed for this 
purpose. 

The rAAV are administered in sufficient amounts 
to transfect the desired cells and provide sufficient 
5 levels of expression of the selected transgene to provide 
a therapeutic benefit without undue adverse, or with 
medically acceptable, physiological effects which can be 
determined by those skilled in the medical arts. 
Conventional and pharmaceutical^ acceptable routes of in 

10 vivo administration include direct delivery to the target 
organ, tissue or site, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal , oral and other 
parental routes of administration. Routes of 
administration may be combined, if desired. 

15 Dosages of the rAAV for the infecting step of 

the method will depend primarily on factors such as the 
therapeutic environment, i.e., ex vivo or in vivo; the 
condition being treated, the selected gene, the age, 
weight and health of the patient, and may thus vary among 

20 patients. A therapeutically effective dosage of the rAAV 
for ex vivo treatment will be based upon the multiplicity 
of infection, which is likely to range from between about 
1 to about 10 transducing particles/cell. A 
therapeutically effective human dosage of the rAAV for in 

25 vivo infection according to the present invention is 

believed to be in the range of from about 20 to about 50 
ml of saline solution containing concentrations of from 
about 1 x 10 7 to 1 x 10 10 transducing viral particles/ml 
virus. A preferred human dosage is about 20 ml saline 

30 solution at the above concentrations. The dosage will be 
adjusted to balance the therapeutic benefit against any 
side effects. The levels of expression of the selected 
gene can be monitored to determine the selection, 
adjustment or frequency of dosage administration. 
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The effective amount of the facilitating agent 
to be administered is within the skill of the art to 
determine and will depend upon the identity of the agent. 
Known dosages of certain of the classes of chemicals and 
5 pharmaceuticals described above may be employed in this 
method to damage the DNA and facilitate ss to ds 
conversion of the rAAV. Where the agent is a gene 
expressed by a helper virus, the amounts of infecting 
virus should be similar to those amounts described above 

10 for the rAAV. Of course, where the agent is a gene 
present in a second generation rAAV, the identical 
dosages described above for the rAAV will apply. 

Several embodiments of the above-described 
methods of this invention were confirmed in murine models 

15 of rAAV mediated gene transfer to both lung and liver. 
These experiments demonstrated similarly low levels of 
gene transfer in vivo by rAAV, which was increased 
several orders of magnitude by coinf ection with El and E4 
expressing adenovirus. 

20 In summary, experiments were conducted to 

demonstrate that adenovirus enhances rAAV transduction in 
cultured cells. During the production and 
characterization of a lacZ recombinant AAV generated in 
293 cells that were coinf ected with an El deleted virus, 

25 it was observed that purification of rAAV from lysates 
was associated with substantial loss of lacZ transducing 
activity when assayed on 293 cells. This drop in rAAV 
activity was particularly evident in the final step where 
residual contaminating helper adenovirus was removed by 

30 heat inactivation. LacZ transducing activity was 

recovered by adding adenovirus back to the purified stock 
of rAAV. These data provided the first indication that 
adenovirus could substantially enhance the transduction 
efficiency of rAAV. 



WO 96/39530 




PCT/US96/10245 



35 

As described in Example 3.0 , a series of 
complementation groups we generated by mixing different 
adenovirus early gene Mutants with purified L&c% r&AV, 
referred to as KVoCmLacS (see Example 2) o These defined 
5 mixtures of viruses were analysed for h&cZ transduction 
on Mela cells (See Examples 12 and 13) o to 11 deletion 
r&d HSoSB&LP and the E4 deletion mutant d!1004 provided 
no significant increase in &VoCKR£&©c£ transduction (Fig<> 
Bh) o However , partial activity could be achieved with El 

10 and E4 Mutants that carried less severe deletions o Both 
dlllO (ElB-55kBa deleted) and dllOlO (ORFS deleted) 
enhanced transduction to levels that approached those of 
&d5, ts!25, and d!802 in terms of the number of positive 
blue cells , but total B-galactosidase activity was 

25 substantially lower (Fig 0 5&) <> These results implicate - 
early regions El and E4 in the augmentation of r&AV 
transduction 

The experiments described below also 
demonstrate that the novel r&AW which incorporates as its 

20 conversion gene, an Ad gene, such as E4, can increase 
transduction efficiency of the rAAV in the absence of a 
helper virus o As described in more detail below in 
Example 15 below, '293 cells were stably transfected with 
a genomic fragment of &d5 spanning E4o This E1/E4 

25 expressing cell line and the parent El expressing cell 
line (293) were infected with r&AV and analysed for 
transduction o These experiments demonstrated the 
significance of the combined expression of SI and 
S4(0RFS) in the adenovirus mediated augmentation of rAAV 

30 transduction o 

In the presence of El and E4 expression, rA&V 
transduction was invariably accompanied by the appearance 
of ds RF monomers and dimers (Example 14) o Importantly, 
the tight correlation between r&AV vector transduction 

35 and the accumulation of duplex forms could be achieved in 
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tno different experimental settings; cells infected nith 
E1/E4 expressing adenovirus (Figs* 8h and 8B) , or 
complementing cell lines (Fig<> 3C) • 

The following example® illustrate the 
5 construction and testing of the novel packaging cell 

lines Q the EI/134 deleted r&d of the present invention and 
the use thereof, improved methods and second generation 
recombinant production for gene therapy of the 
present invention <> These examples are illustrative only, 
10 and do not limit the scope ©f the present invention o 
Example 1 - Hovel Ela /Elb and B4 Express ing Packaging 
Cell Lines 

&o Construction of S4 OEF <$ Expressing Pl&smi&s 
The entire 34 region from Ad5 or an QRF6 

15 minigene ^ere subeloned into a shuttle plasmid that 

contained a neomycin resistance geneo Two versions of 
ORFS minigene ^ere developed that differed in the 
promoter element o The first used a Zn+2 inducible sheep 
metallothionine (Ear) promoter to drive QBF 6 expression o 

20 The second used a dexamethasone° inducible mouse mammary 
tumor virus (MTV) promoter o 

Mi exemplary plasmid useful for the 
• construction of a packaging cell line of this invention 
is pMMTVE40RFSo The minigene contained in this plasmid 

25 is set out in SEQ ID BJOs 1, and contains a mouse mammary 
tumor virus promoter (E3B2TV) (nucleotides 1-1506 of SEQ ID 
MO si) in transcriptional control of a human E4 ORF 6 gene 
sequence (nucleotides 1523-2408 of SEQ ID BJOs 1), a 
growth hormone terminator (GH) (nucleotides 2409=3654 of 

30 SEQ ID SJOs 1) , an SV40 origin of replication, plasmid 
sequences from plasmid pBR322, including a neomycin 
resistance gene, and an ampicillin resistance geneo The 
amino acid sequence of ORF 6 is indicated in SEQ ID NOs 
2o The various functional fragments of this plasmid may 

35 be readily replaced uith other conventionally used 
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sequences and are not critical to the design of the 
plasmid. 

Another plasmid useful for the construction of 
a packaging cell line of this invention is pMTE40RF6. 
5 The DNA sequence of the minigene contained in this 

plasmid is similar to that of SEQ ID NO: 1, except that 
the promoter is a sheep metallothionine promoter (MT 
promoter) [M. 6. Peterson et al, cited above]. 

A plasmid used as a control for the 
10 construction of a packaging cell line of this invention 
is pLTR.E4(-). This plasmid contains the endogenous 
constitutive retroviral MLV LTR and most of the Ad E4 
gene region except that the endogenous E4 promoter and a 
portion of E4 ORF1 are missing. The other plasmid 
15 sequences remain the same as described above. 

Still another plasmid useful for the study of 
the methods of this invention is pLTR . E4 , which contains 
the constitutive MLV LTR and endogenous E4 promoter and 
an intact E4 gene. The other plasmid sequences remain 
20 the same as described above. 

To determine whether 0RF6 expression was 
sufficient to enhance rAAV transduction, the inducible 
metallothionein (MT)-ORF6 minigene was stably transfected 
into HeLa cells. This new cell line, HeLa (MT— ORF6) was 
25 evaluated for LacZ rAAV transduction in response to ORF6 
induction as described below. The cell line 293 
(MT-0RF6) expresses ORF-6 of the E4 gene of Ad5 from the 
metallothionine promoter which is relatively inactive at 
baseline but can be induced with divalent cations. These 
30 293 cells were included to establish the baseline 
transduction efficiency. 

B. Transfections and Selection of Clones 

Each of the above-described plasmids was 
transfected. by the calcium phosphate precipitation 
35 technique into the human embryonic kidney cell line 293 
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[ATCC CRL1573] which expresses the product of the 
adenovirus El genes, or into HeLa cells, seeded on 100 mm 
plates (10 M9 plasmid/plate) . Twenty four hours post- 
transfection, cells were harvested and seeded at varying 
5 dilutions (1:10 - 1:100) in 100 mm plates for about 10 
days. Seeding media contain 6418 (Genet icin, BRL) at 1 
mg/ml. Resistant colonies that developed were selected 
using the following assays and expanded. Preliminary 
analysis of clones was based on enhanced transduction 

10 efficiency of a recombinant adeno-associated virus, 

AV.CMV.LacZ, and immunofluorescence localization of Ad E4 
protein as described in the following examples. 
Example 2 - Recombinant AAV and AV.CMBI.acZ Transduction 
Enhancement Assay 

15 El and E4 Ad gene products are needed for 

recombinant adeno-associated virus (AAV) function. This 
primary assay involves seeding the packaging cell lines 
of Example 1 in 96 well 35 mm culture plates (2x10* 
cells/well) and infecting the cells with purified, heat- 

20 treated AV.CMVLacZ at an HOI of 1000 virus 
particles/cell. 

A. Preparation of Recombinant AV.CMVLacZ 

A recombinant AAV virus was prepared by 
conventional genetic engineering techniques for the 

25 purposes of this experiment. Recombinant AAV was 

generated by plasmid transf ections in the presence of 
helper adenovirus [Samulski et al, J. Virol. . 63:3822- 
3828 (1989) J. A cis-acting plasmid pAV.CMVLacZ [SEQ ID 
NO: 4] (see Fig. 10) was derived from psub201 [Samulski 

30 et al, J. Virol.. £1:3096-3101 (1987)] and contains an E. 
coli 6-galactosidase minigene in place of AAV Rep and Cap 
genes. The 5' to 3' organization of the recombinant 
AV.CMVLacZ genome (4.9 kb) [SEQ ID NO: 4] includes 

(a) the 5» AAV ITR (bp 1-173) was 

35 obtained by PCR using pAV2 [C. A. Laughlin et al, Gene . 
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23: 65-73 (1983)] as template [nuc. 53-219]; 

(b) a CMV immediate early 
enhancer /promoter [Boshart et al, £sll, 11:521-530 
(1985) ] (nuc. 246-839) ,* 
5 (c) an SV40 intron (nuc. 856-987) ; 

(d) E. coli B-galactosidase cDNA (nuc. 

1039-4512) ; 

(e) an SV40 polyadenylation signal (a 237 
Bam Hl-Bcll restriction fragment containing the 

10 cleavage/poly-A signals from both the early and late 
transcription units (nuc. 4522-4719) and 

(f) 3' AAV ITR, obtained from pAV2 as a 
SnaBI-Bglll fragment (nuc. 4759-4925). All other 
nucleotides are plasmid derived. 

15 Rep and Cap genes were provided by a trans- 

acting plasmid pAAV/Ad [Samulski et al, cited above]. 

Monolayers of 293 cells grown to 90% conf luency 
in 150 mm culture dishes (5xl0 7 cells/plate) were 
infected with H5.CBALP at an NOI of 10. H5.CBALP (also 

20 called H5.010ALP) is a rAd that contains an alkaline 

phosphatase minigene in place of adenovirus Ela and Elb 
gene sequences (map units 1-9.2 of the Ad5 sequence of 
GenBank [Accession No. H73260]). The alkaline 
phosphatase cDNA is under the transcriptional control of 

25 a CMV-enhanced B-actin promoter in this virus. This 
helper virus is described in Goldman et al, Hum. Gene 
Ther. . 6:839-851 (July, 1995); Engelhardt et al, Hum. 
Gene Ther . . 5:1217-1229 (October, 1994); and references 
cited therein. 

30 Infections were done in Dulbecco's Modified 

Eagles Media (DMEM) supplemented with 2% fetal bovine 
serum (FBS) at 20 ml media/ 150 mm plate. Two hours post- 
infection, 50 ng plasmid DNA (37.5 fig trans-acting and 
12.5 fig cis-acting) in 2.5 ml of transfection cocktail 

35 was added to each plate and evenly distributed. 
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Transf ections Here calcium phosphate based as described 
[Bo Cullen, Ktetho Bnzvmol. ,, i££s684-704 (1987) ]c Cells 
Here left in this condition for 10-14 hours after Hhich 
the infection/transfection media Has replaced nith 20 ml 
5 fresh DMEBJ/2% FBS. Forty to fifty hours post- 

transf ect ion, cells Here harvested, suspended in 10 mB2 
Tris-Cl (pH 8o0) buffer (0 o 5 ml/150 mm plate) and a 
lysate prepared by sonication<, The lysate Has brought to 
10 manganese chloride, after Hhich bovine pancreatic 

10 DHase I (20,000 units) and HWas© (0o2 mg/ml final 

concentration) Here added, and the reaction incubated at 
37 ®C for 30 minutes o Sodium deoxycholate Has added to a 
final concentration of 1% and incubated at 37® C for an 
additional 10 minutes „ 

15 The treated lysate Has chilled on ice for 10 

minutes and solid CsCl added to a final density of lo3 
g/mlo The lysate Has brought to a final volume of SO ml 
nith lo3 g/ml CsCl solution in 10 mH Tris-Cl (pH 8 o 0) and 
divided into three equal aliquotSo Each 20 ml sample Has 

20 layered onto a CsCl step gradient composed of tno 9„0 ml 
tiers nith densities lo45 g/ml and loSO g/mlo 

Centrifugation Has performed at 25,000 rpm in a 
• Beckman SW-28 rotor for 24 hours at 4°Co One ml 
fractions Here collected from the bottom of the tube and 

25 analyzed on 293 or 293 (£4) cells for L&cZ transduction** 

Fractions containing peak titers of functional AV.GMVZ&cZ 
virus Here combined and subjected to three sequential 
rounds of equilibrium sedimentation in CsClo Rotor 
selection included a Beckman HVT-90 (80,000 rpm for 4 

30 hours) and SW-41 (35,000 rpm for 20 hours) . At 

equilibrium, hVoCmLacZ appeared as an opalescent band at 
Io40-lo41 g/ml CsClo Densities Here calculated from 
refractive index measurements o Purified vector Has 
exchanged to 20 mH HEPES buffer (pH7o8) containing 150 mM 

35 NaCl (HBS) by dialysis and stored frozen at -80 °C in the 
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presence of 10% glycerol or as a liquid stock at -20°C in 
HBS/40% glycerol. 

Purified virus was tested for contaminating 
H5.CBALP helper virus and AV.CMVLacZ titers. Helper 
5 virus was monitored by histochemical staining for 
reporter alkaline phosphatase activity. A sample of 
purified virus representing 1.0% of the final product was 
added to a growing monolayer of 293 cells seeded in a 60 
mm plate. Forty-eight hours later, cells were fixed in 

10 0.5% glutaraldehyde/phosphate buffered saline (PBS) for 
10 minutes at room temperature, washed in PBS (3x10 
minutes) and incubated at 65 °C for 40 minutes to 
inactivate endogenous alkaline phosphatase activity. The 
monolayer was allowed to cool to room temperature, rinsed 

15 once briefly in 100 mM Tris-Cl (pH9.5)/100 mM NaCl/5mM 
MgCl, and incubated at 37°C for 30 minutes in the same 
buffer containing 0.33 mg/ml nitroblue tetrazolium 
chloride (NBT) and 0.165 mg/ml 5-bromo-4-chloro-3- 
indolylphbsphate p-toluidine salt (BCIP) . Color 

20 development was stopped by washing the monolayer in 10 mM 
Tris-Cl (pH 8.0) /5 mM EDTA. Routinely the purification 
scheme described above removed all detectable H5.CBALP 
helper virus by the third round of buoyant density 
ultracentr if ugation • 

25 AV.CMVLacZ titers were measured according to 

genome copy number (virus particles/ml) , absorbance at 
260 nm (A^ particles/ml) and LacZ Forming Units 
(LFU/ml) . Virus particle concentrations were based on 
Southern blotting. Briefly, a sample of purified 

30 AV.CMVLacZ was treated with capsid digestion buffer (50 
mM Tris-Cl, pH 8.0/1.0 mM EDTA, pH 8.0/0.5% 
SDS/Proteinase K 1.0 mg/ml) at 50°C for one hour to 
release virus DNA. The reactions were allowed to cool to 
room temperature, loading dye was added and 

35 electrophoresed through a 1.2% agarose gel. Standard 
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quantities of ds AV.CMVLacZ genome were also resolved on 
the gel. 

DNAs were electroblotted onto a nylon membrane, 
hybridized with a random primer labeled restriction 
5 fragment, and the resulting blot scanned on a 

Phosphor Imager 445 SI (Molecular Dynamics) . A standard 
curve was generated from the duplex forms and used to 
extrapolate the number of virus genomes in the sample. 
LFU titers were generated by infecting indicator cells 

10 with limiting dilutions of virus sample. Indicator cells 
included HeLa and 293 and 293 (E4) lines (described in 
Example 10 below) • Twenty-four hours later, cells were 
fixed in glutaraldehyde and cells were histochemically 
stained for E. coli 0-galactosidase (LacZ) activity as 

15 described in J. M. Wilson et al, Proc. Natl. Acad. Sci. 
XZ§&, £5:3014-3018 (1988). One LFU is described as the 
quantity of virus that is sufficient to cause visually 
detectable B-galactosidase expression in one cell 24 
hours post-infection. 

20 B. Induction of ORF6 Expression 

Induction of 0RF6 expression with 10 MM 
dexamethasone or 150/iM zinc sulfate (for negative 
control, no inducer used) was initiated 2 hours before 
the addition of virus and continued throughout the 

25 duration of the experiment. Twenty-four hours after the 
addition of virus, cells were harvested, lysates were 
generated by sonication and analyzed for the B- 
galactosidase expression (i.e., B-galactosidase activity) 
and virus DNA as described above. Hirt extracts were 

30 prepared from low molecular weight DNA from cell 

extracts. The preparation of the Hirt extracts and 
subsequent analysis by Southern hybridization were 
performed by resort to conventional procedures known to 
one of skill in the art. 

35 in the absence of the inducers, the packaging 
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cell lines generate lower levels of B-galactosidase in 
rAAV infected cells. Induction of ORF6 expression with 
the inducer dexamethasone results in a concomitant rise 
in AV.CMYLacZ cell transduction to a level that was much 
5 greater than the parent 293 line. Expression of El alone 
was insufficient to have an effect in the adenovirus 
mediated augmentation of rAAV transduction. 

Results are demonstrated for certain positive 
clones in the Table I below (see Example 4). However, 

10 for 30 cell lines having an MMTV promoter and ORF6 

sequence, 4 demonstrated over 90% blue cells illustrative 
of LacZ production in the presence of dexamethasone, 
i.e., 293-27-6, 293-27-17, 293-27-18 and 293-27-28. 
Example 3 - Immunoflu orescence Localization of Ad5 Late 

15 protein. 

Positive clones from the assay of Example 2 were 
infected with the recombinant E4 deleted adenovirus 
H5dll004 and screened for E4 complementation using an 
immunofluorescence assay for late gene expression. The 

20 H5dll004 virus was obtained from Dr. Ketner of Johns 

Hopkins University and is described in Bridge and Ketner, 
J. Virol, . £22.(2) : 631-638 (Feb. 1989), incorporated by 
reference herein. Because ORF6 of E4 complements late Ad 
gene expression, specifically in the formation of the 

25 hexon and penton fibers of the adenovirus, cell lines 
containing ORF6 are able to bind with antibody against 
these proteins. 

Each cell line of Example 1 is infected with E4 
deleted virus H5dll004 virus at an MOI of 0.1. The cells 

30 were treated with mouse anti-adenovirus FITC-labeled 

monoclonal antibody to either the hexon or penton fibers 
in a 1:10 dilution (Chemicon International Inc., 
Temecula, CA) • Positive clones were identified by 
reaction with the antibody. 
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Example 4 - Relative Plaguing Efficiency 

The cell lines of Example 1 demonstrating with 
strong complementation ability in Example 3 were screened 
for relative plaquing efficiency of H5dll004 as compared 
5 to W162 cells (an E4 -complementing Vero cell line which 
does not express El) [Weinberg and Ketner, Proc. Watl« 
Acad, Sci. USA . 80 (1?) : 5383-5386 (1983) ]• In Table II 
below, RPE%, i.e., relative plaquing efficiency, 
represents the titer of H5dll004 on tested cell 

10 lines/ titer of H5dll004 on W162 cells. For example, the 
RPE of 293 cells is 0. 

The positive cell lines selected by all criteria are 
identified in Table I below, with the results of the 
assays of Examples 2, 3 and 4. 

15 TABLE I 



E1/E4 Double Complementing Cell Lines 



Cell 
Line 


Trans- 
Gene 


Pro- 
moter 


AV.CMV 

IF/LP LacZ 


RPE% 


293-10-3 


0RF6 


MT 


■ » i i iiii 
TTTT TTTT 


246 


293-39-11 


ORF6 


LTR 


++++ +++ 


52 


293-84-31 


E4- 


LTR 


1111 A 1 1 ,1 

TTTT TTTT 


179 


293-12-31 


whole 
E4 


LTR 
+E4 


++++ ++++ 


174 


293-27^6 


0RF6 


MMTV 


+++++ 


327 


293-27-17 


0RF6 


MMTV 
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339 


293-27-28 


0RF6 


MMTV 
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Example 5 - Construction and Purification of HS.OOlCBLacZ 
30 The plasmid pAd.CBLacZ was constructed as described 

in detail in K. Kozarsky et al, Som. Cell Mol. Genet. . 
12(5): 449-458 (1993), incorporated by reference herein. 
This plasmid contained a minigene comprising a 5* 
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flanking Nhel restriction site, followed by Ad5 sequence 
m.u. 0-1, followed by an El deletion into which is 
inserted a CMV enhancer/ chicken 6-actin promoter sequence 
[T. A. Kost et al, Nucl. Acids Res. , H(23):8287 (1983)], 
5 which controls the transcription of the following 

bacterial B-galactosidase, followed by a poly A sequence 
and flanked 3 9 by Ad m.u, 9-16, and another Nhel site. 
In the plasmid, the minigene was flanked on both sides by 
plasmid sequence containing drug resistance markers. 

10 The plasmid pAd.CBLac2 was linearized with Nhel and 

co-transf ected by the calcium phosphate co-transf ection 
method into the novel packaging cell line of Example 1 
with Clal digested H5dll004 (an Ad5 sequence deleted of 
from about map unit 92.1 through map unit 98, 

15 corresponding to substantially the entire E4 gene) . 

Homologous recombination occurs in the cell line 
between these two viral constructs between Ad map units 
9-16,,, resulting, in rAd r , designated^. OOlCBLacS [SEQ ID 
NO: 3] (Fig. 2). This rAd contains the sequence from 

20 pAd.CBLacZ (including Ad map units 0-1 (nuc. 1-330); CMV 
enhancer/ chicken 6-actin promoter (CB) (nucs. 370-928) ; 
E. coli B-galactosidase (nucs. 945-4429); the polyA (nuc. 
4429-4628); and Ad5 map units 9-92.1 and 97.3 to 100 from 
H5dll004 (nucs. 4671-35408)). This rAd is thereby 

25 functionally deleted, and substantially structurally 
deleted, of the Ad El and E4 genes. 

Viral plaques were selected and screened by the B- 
galactosidase assay [Wilson (1988) , cited above] and 
H5.001CBLac£ was isolated following three rounds of 

30 plaque purification. The purified virus was also 

subjected to cesium chloride density centrifugation and 
large scale production. 

For the following mouse experiments, virus was used 
after column purification and glycerol was added to a 
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final concentration of 10% (v/v) . Virus was stored at - 
70 °C until use. 

Example 6 - Growth Kinetics of H5 . OOlCBLacZ in Packaging 
Cell Lines 

5 The cell lines identified in Table I were infected 

with recombinant H5.001CB.LacZ at an MO I of .5. The 
growth kinetics of this virus in the E4 complementing 
cell lines are shown in Fig. 3. Maximum viral yield is 
reported as LFU/ml in Table II below. 
10 TABLE II 



Cell Line 


Maximum Viral 


293-10-3 


2 . 8X10 10 


293-39-11 


9.5X10* 


293-84-31 


1.1X10 9 


293-12-31 


4 . 5X10 8 


293-27-6 


2.8X10 10 


293-27-17 


2.5X10 10 


293-27-18 


2^9xl0 10 _ 


293-27-28 


1.2X10 10 



20 When grown in 293-27-18 cells (the E4 ORF6 cell line 

with MMTV promoter inducible by dexamethasone) the 
maximum yield of this virus is 2.9X10 10 LFU/ml. Several . 
of the cell lines were passaged between 5 and 20 times 
and the viral production of the passages remained stable. 

25 However, RPE did fall following repeated passages of 
cells. 

Example 7 - Other Recombinant Adenoviruses 

Other related rAds were prepared similarly to 
HS.OOlCBLacZ by homologous recombination between 
30 pAdCBLacZ and other helper viruses. 

As one example, HS.OOOCBLacZ is a recombinant El 
deleted Ad5 which contains the same minigene as 
H5.001CB.LacZ, but has an intact E4 gene. This rAd was 
prepared as described by homologous recombination between 
35 pA&CBLacZ and a wild-type Ad5. 
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As another example , H5.010CB.LacZ contains the 
adenovirus map units 0-1 , followed by a CMV enhanced , 
chicken cytoplasmic 0-actin promoter, the E. coli 6- 
galactosidase gene (lacZ) , a polyadenylation signal (pA) , 
5 and adenovirus type 5 map units 9-100, with a small 

deletion in the E3 gene (the Ad 5 sub360 backbone). This 
rAd may be prepared by homologous recombination between 
the pAdCBLacZ vector and Ad5 virus sub360, which contains 
a 150 bp deletion within the 14,6 kD protein of the E3 

10 gene. See, e.g., J. F. Engelhardt et al, Proc. Natl. 

Acad. Sci. . USA , 91:6196-6200 (June 1994); and Engelhardt 
et al, Hum. Gene Ther. . 5:1217-1229 (Oct. 1994), both 
incorporated by reference herein. 

These rAds were isolated following transfection 

15 [Graham, Virol. , 52:456-467 (1974)], and were subjected 
to two rounds of plague pur if ication. Ly sates were 
purified by cesium chloride density centrifugation as 
^previously described [Englehardt et al. Proc. Natl. Acad. 
Sci. USA . 88:11192-11196 (1991)]. Cesium chloride was 

20 removed by passing the virus over a BioRad DG10 column 
using phosphate-buffered saline. 
Example 8 - LacZ Gene Transfer into Mouse 
A. Transfer into Mouse Muscle 

Five to six-week old male C57B/6 mice were 

25 anesthetized. Anterior tibialis muscles were exposed and 
directly injected with either rAd H5. OOOCBLacZ, 
HS.OlOCBLacZ or H5. OOlCBLacZ as follows: 25 /iL of 
purified viral suspension at a stock concentration of 5 x 
10" virus particles /mL was injected by inserting the tip 

30 of the 33 gauge needle of a 100 /xL Hamilton syringe into 
the belly of the muscle. 

Animals were sacrificed on day 4, 14, 28 and 60 
post injection. The muscles were dissected and frozen in 
liquid nitrogen cooled isopentane. Six sections were 

35 cut in a cryostat, fixed and stained for B-galactosidase 
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activity for 6 hours at 37 °C. 

While the blue stained rAd was found for each 
virus in the day 4 and day 14 (most abundant) stains, by 
day 28 , the H5.001CBLac£ clearly demonstrated more virus 
5 on day 28. By day 60 , the only virus which stained 
positive was the H5.001CBLac2. 

B. Transfer into Mouse Lung and Circulation 
RAd H5.000CB.LacZ (control), and HS.OOlCBLacZ 

(IxlO 11 viral particles) were administered to six week old 
10 C57BL/6 female mice by tail vein injection and trachea 

installation. The animals were sacrificed and their 

liver and lung tissues were harvested at days 4, 9, 21, 

28 and 35 post-administration. The transgene and viral 

late gene expression were compared. 
15 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 

points in comparison with transgene. 

C. Dose Responses in Liver 

Dose responses of E4 -deleted and E4 intact rAds 

20 in the liver of C57BL/6 mice were studied by tail vein 
administration of 1.5X10 11 , 5xl0 10 , 1.7xl0 10 , 5.6xl0 9 , and 
1.9xl0 9 viral particles and comparing the transgene and 
viral late gene expression at day 4, 21, 28, 35, and 42 
post administration. 

25 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 
points in comparison with transgene. 
Example 9 - Other Gene Transfers 
A. Human OTC Gene Transfer 

30 The human OTC gene [A. L. Horwich et al, 

Science. 224:1068-174 (1984)] or the human CFTR gene 
[Riordan et al, Science , 245:1066-1073 (1989)] was used 
to replace the LacZ as the transgene in the recombinant 
E1/E4 deleted adenoviruses described above, using the 

35 techniques analogous for the construction of the above- 
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described LacZ vectors. 

The resulting human OTC-containing rAd were 
administered at an MOI of 10 to 30 to human hepatocytes. 
The E1/E4 deleted rAd demonstrated less replication and 
5 less late gene expression than when the El/ £4 deleted 
rAds are administered to muscle, as described in the 
example above. However, the results of this gene 
transfer are better than comparable transfers with rAds 
containing only a deletion in the El gene or a deletion 
10 in the El gene and a point mutation in the E2a gene. 

Similar results are demonstrated when the 
transgene is CFTR and the method of administration is 
intratracheal into lungs. 

Example 10 - Transduc tion Effic iency of rAAV LacZ 
15 AV.CMVLacZl in HeLa Cells Infected with Ad Mutants 
A. Viruses 

The following viruses were employed in this 
experiment: 

(1) Wild-type Ad 5, propagated in 293 cells; 
20 (2) Ad dlllO (an Ad which is deleted of the 55 

kb E1B gene) [Babiss et al, J. Virol. . 52(2) :389-395 
(1984) and Babiss and Ginsberg, J. Virol. , 50(1) :202-212 
(1984)], propagated in 293 cells, 

(3) H5.CBALP (an Ad deleted of its E1A and E1B 
25 genes and containing a minigene that expresses alkaline 

phosphatase from a CMV enhanced j8-actin promoter, as 
described above) , propagated in 293 cells, 

(4) Ad tsl25 (an Ad with a temperature 
sensitive mutation in the E2A gene which encodes the DNA 

30 binding protein) [Ensinger and Ginsberg, J . Virol . , 
10(3) :328-339 (1972)], propagated in 293 cells, 

(5) Ad dl802 (an Ad deleted of its E2a gene) , 
grown in E2A-complementing gmDBP cells as described in 
Rice and Klessig, J. Virol. . 56 (3) :767-778 (1985); 
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(6) Ad dll004 (an Ad deleted of the E4 gene), 
grown in E4 -complementing Vero W162 cells [Weinberg and 
Ketner, Proc. Natl. Acad. Sci. USA . £0(17) :5383-53B6 
(1983)] and 

5 (7) Ad dllOlO (an Ad deleted of ORF6 of its E4 

gene) , grown in E4 -complementing Vero W162 cells 
[Weinberg and Ketner, cited above] . 

All viruses were purified by two sequential 
rounds of buoyant density ultracentrifugation in CsCl. 

10 B. Experimental Procedures 

HeLa cells seeded in 6 well, 36mm culture 
plates (2x10* cells/well) were infected with wild-type 
Ad5 or an adenovirus early gene mutant as described in 
Part A at an MOI of lOpfu/well. Infections were done in 

15 1.0 ml DMEM/2%FBS. Six hours post-infection, monolayers 
were washed and 1.0 ml fresh DMEM/2% FBS media containing 
AV.CMVLacZ at 4xl0 9 virus particles/ml were added. 
Although the AV.CMVLacZ virus lot used in these 
experiments was shown to be free of H5.CLALP helper virus 

20 by histochemical staining, the virus sample was subjected 
to heat treatment (60°C for 20 minutes) prior to use to 
ensure the absence of contaminating adenovirus. Two 
hours later, 1.0 ml of DMEH/115% FBS was added to each 
well. 

25 Twenty-four hours after the addition of 

AV.CMVLacZ, cells were harvested. Each test condition 
was done in triplicate to enable virus transduction to be 
evaluated in terms of three outputs: histochemical 
staining for fi-galactosidase activity (below) , 

30 intracellular fi-galactosidase specific activity (Example 

11) , and the molecular form of the virus DNA (Example 

12) . 

HeLa cells were histochemical ly stained for E. 
coli 0-galactosidase (LacZ) activity as described in J. 
35 M. Wilson et al, Proc, Natl. Acad. Sci. USA . M: 3014-3018 
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(1988) . The different combinations that were tested 
included cells transf ected with AAV vector alone 
(AV.CKVLacZ) , vector plus wild-type Ad5 (+Ad5) , vector 
plus dlllO (+dlllO) , vector plus Ad mutant H5.CBALP 
5 (+H5.CBALP) , vector plus Ad mutant tsl25 (+tsl25) , vector 
plus Ad mutant dl802 (+dl802) , vector plus Ad mutant 
dll004 ( -Kill 004) , and vector plus Ad mutant dllOlO 
(+dll010) . 

The results were observed in photomicrographs 

10 at magnification 10X (not pictured) of histochemical 
stains for recombinant fi-galactosidase activity. The 
results indicated that wild-type Ad5 and the E2a mutants 
tsl25 and dl802 caused a significant increase in LacZ 
rAAV transduction as measured by the number of positive 

15 blue cells and the degree of stain intensity. Both dlllO 
(ElB-55kOa) and dllOlO (0RF6) enhanced transduction to 
levels that approached those of Ad5, tsl25, and dl802 in 
terms of the number of positive blue cells. 

The El deletion recombinant H5.CBALP provided 

20 no significant increase in AV.CMViacZ transduction. 

Expression of El alone was insufficient to have an effect 
in the adenovirus mediated augmentation of rAAV 
transduction as evidenced by lack of significant increase 
in transduction obtained with HeLa cells infected with 

25 the E4 deletion mutant dll004. A significant drop in 

transduction occurred following removal of ORF6 from the 
E4 region from the coinfecting adenovirus (Fig. 5A) . 

It is believed that these results demonstrate 
that the adenoviral gene products, E4 and El indirectly 

30 promote the formation of ds DNA intermediates that are 
transcriptionally active. 

Example 11 - Quantitation of Enhanced Vector Transduction 

(A) A duplicate set of HeLa cells as described in 
Example 10B were used in this experiment. Twenty-four 
35 hours after the addition of AV.CMVLacZ recombinant, for 
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intracellular fi-galactosidase assays, cell pellets were 
suspended in 0.5 ml PBS and sonicated. Cell debris was 
removed by centrifugation (15,000Xg for 10 minutes) and 
the clarified extract assayed for total protein [M. 
5 Bradford, Anal. Biochem. , 7^(1-2) : 248-254 (1976) and M. 
Bradford et al, Fed. Proc . 25(3) :274 (1976)] and B- 
galactosidase activity [Sambrook et al, cited above] 
using o-nitrophenyl fi-D-galactopyranoside (0NP6) as 
substrate. 

10 Fig. 5A demonstrates the transduction 

efficiency quantitated by measuring 6-galactosidase 
enzyme activity in the lysates from infected Hela cells 
and also assayed for total protein. In Fig* 5A, the test 
condition is shown along the horizontal axis, and 

15 intracellular fi-galactosidase specific activity 

(milliunits/mg protein) using ONFG as substrate is 
plotted on the vertical axis. Below each bar, the 
fold-induction in specif ic activity relative to cells 
that received the AV.CMVLacZ vector alone is given. 

20 The results of Fig. 5A demonstrate that the E2a 

mutants tsl25 and dl802 produced 134-fold and 225-fold 
increases in 6-galactosidase activity, respectively, as 
compared to that achieved with purified rAAV alone. In 
comparison, cells infected with wt Ad5 generated 107-fold 

25 increase in B-galactosidase activity. 

(B) In another experiment, HeLa cells (2xl0 6 ) were 
infected with increasing multiplicities of wild-type Ad5 
or the E2 mutant dl802. Six hours post-infection, 
monolayers were washed and infected with AV.GMVLacZ at 

30 1000 virus particles/cell. Twenty-four hours after the 
addition of AV.GMVLacZ, cells were harvested and assayed 
for total protein and B-galactosidase activity. 

The results are illustrated in the bar graph of 
Fig. 5B, in which adenovirus MOI's are given along the 

35 horizontal axis, and intracellular fi-galactosidase 
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specific activity along the vertical axis*, Enhancement 
of r&AV transduction was proportional to input helper 
adenovirus fro® EJOIs of 1 to 50 for both wild type Ad5 
and d!802o Higher doses of virus were cytopathic, 
5 leading to a fall in fi»galactosidase expression o 

Enhanced transduction was achieved when the cells were 
infected prior to, or at the time of, r&AW infection <> 
The El deletion recombinant H5oCBAX*P and the S4 deletion 
mutant d!1004 provided no significant increase in 

10 KVoGmL&cZ transduction o Both cells infected with dlllO 
(ElB-55kBa) and with dllOlO (ORF6) demonstrated 
substantially lower total B-galactosidase activity than 
those infected with Ad5, ts!25, or d!802o 
Sample 12 - Analysis of Low Mole cular We iqht_JOH&g_JUa 

15 AVoCmLaaZ Transduced Cells 

Studies with these early gene mutants of adenovirus 
suggested that expression of adenoviral genes rather than 
the virion itself was responsible, for enhancement of r&AV 
transduction o To further investigate these mechanisms 

20 and to determine if conversion of ss to ds genome limits 
the transduction efficiency of rAAV, the molecular state 
of the rAAV genome was characterized in the infected 
cells o The relationship between RFm formation and' l&cZ 
rAAV transduction was explored in experiments where the 

25 dose of coinfecting virus was varied (M)X=1„ 5, or 10) o 
{A} & duplicate set of HeLa monolayers ©s 
described in Example 10 were harvested 24 hours after 
they were transduced with the recombinant &VoC8SV£ac£ and 
cultured with or without helper adenovirus. 

30 Episomal DNA was extracted from cell pellets 

using a modification of the procedure originally 
described by Bo Hirt, J. Mol. Biol» „ £6s365-369 (1967) . 
Briefly, cells were suspended in 320 ml Tris-Cl 
(pH8oO)/10 EDTA and SDS added to a final concentration 

35 of 1% 0 The mixture was incubated at 37 °C for 30 minutes . 
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Pronase and proteinase K were added to final 
concentrations of 500 Mg/nl and 20 pg/ml, respectively, 
and the reaction incubated at 37 °C for 2 hours. Sodium 
chloride was added to a final concentration of 1.1 M and 
5 incubated at 4°C overnight. The precipitate that 
developed during the 4°C incubation was pelleted at 
20,000 xg for 30 minutes and the clear supernatant 
carefully removed. The supernatant was extracted once 
with phenol: chloroform: isoamyl alcohol (25:24:1) followed 

10 by chloroform: isoamyl alcohol (24:1). Nucleic acids were 
precipitated with ethanol. The final pellet was 
suspended in 50 /il Tris-Cl (pH 8.0) /1. 0 mM EDTA. 

These Hirt extracts were analyzed by Southern 
blot hybridization. Samples (5 fil) of each Hirt extract 

15 were resolved through a 1.2% agarose gel, electroblotted 
onto a nylon membrane and hybridized with a 32 P random 
primer labeled cDNA of the SV40 polyA signal used in 
AV.CMVLacZ. 

An autoradiogram of the experiment of Example 

20 12 (not pictured) , identifies and labels bands 

corresponding to the ss AV.GMVLacZ genome (SS) , a monomer 
replicative form (RFm) , and concatomer replicative forms 
(RFd) • Bands corresponding to the ss AV.OWLacZ genome 
(SS) , a monomer replicative form (RFm) , and concatomer 

25 replicative forms (RFd) were identified and labeled. To 
reference the RFm band, a plasmid carrying AV.CMVLacZ was 
digested to release the entire genome. Autoradiogram 
exposure times were 14 hours and 69 hours. 

In this autoradiogram, the full spectrum of 

30 molecular species present during a lytic infection was 
demonstrated in cells infected with both LacZ rAAV and 
wild type adenovirus. Both the input ss genome (SS) and 
monomeric and dimeric forms of ds replicative 
intermediates (RFm and RFd) are present. This contrasts 

35 with cells infected with purified rAAV alone, where ss 
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genome is the sole molecular form detected. Analysis of 
cells coinf ected with the adenovirus early gene mutants 
revealed a direct correlation between formation of ds 
forms of the rAAV genome and the enhancement of LacZ 
5 transduction. Mutant adenoviruses that were ineffective 
in enhancing rAAV transduction (i.e., the El deleted 
mutant H5.CBALP and the E4 deleted mutant dll004) failed 
to promote the formation of ds forms of AAV. 

Cells infected with adenovirus deleted of E2a 

10 (dl802) or partially deleted of El (dlllO) or E4 (dllOlO) 
additionally demonstrated a band whose size was identical 
to the ds replicative monomer (RFm) of the lacZ rAAV 
genome and whose abundance correlated directly with the 
expression of fi-galactosidase activity (compare results 

15 of Example 14 to these described results) . Slower 

migrating concatomers, likely dimers, of duplex rAAV were 
also detected in the autoradiogram described above. 

In the presence of El and E4 expression, rAd 
transduction was invariably accompanied by the appearance 

20 of ds RF monomers and dimers. 

The high molecular weight band in sample lane 
+H5.CBALP is helper virus DNA. Helper virus DNA is 
recognized by the SV40 probe because the CBALP minigene 
also utilizes the SV40 polyA signal. 

25 (B) In another experiment, HeLa cells were infected 

with wt Ad5 or the E2 deleted mutant dl802 as described 
in Example 10B. Monolayers were harvested 24 hours later 
and analyzed for fi-galactosidase activity and RFm 
synthesis. Monomer bands similar to those shown in the 

30 autoradiogram described above were quant itated on a 
Phosphor Imager 445 SI and assigned values (CPM) • 

The results are illustrated in the graphs of 
Figs. 8 A and 8B, in which 6-galactosidase specific 
activity and CPM are plotted along the vertical axis of 

35 each figure. Adenovirus MOI's are given on the horizontal 
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axis of each figure. Data obtained from low MO I 
infections (1, 5, and 10) are shown. Importantly, the 
tight correlation between rAAV vector transduction and 
the accumulation of duplex forms could be achieved in 
5 cells infected with E1/E4 expressing adenovirus. The 
level of fi-galactosidase and abundance of RFm increased 
in proportion to the amount of infecting wild type Ad 
(Fig. 6A) and dl802 (Fig 6B) • These data suggest that 
synthesis of an episomal duplex intermediate is an 

10 obligatory event in transduction. 
Example 13 - Duplex End-Analvsis 

The following is a description of a model for 
leading strand synthesis of a complementary AAV strand in 
the presence of Rep (+Rep) or absence of Rep (-Rep) . 

15 Refer to Figs. 7A-7F. Rep expresses a terminal 

resolution activity that can convert a duplex structure 
with closed-ends to an open-ended duplex. In the absence 
of Rep, terminal resolution is impaired leaving the 
covalently closed, hairpin structures intact. Under 

20 these conditions, hairpins are expected to be found 

leftward and rightward, since both strands of a rescued 
ds AAV genome are packaged into virions. Figs. 7B-7F are 
a flow chart demonstrating the strategy for identifying 
the terminal structure of duplex RFm that is synthesized 

25 from ss AV.GMVLacZ in response to adenoviral gene 
expression. 

Fig. 7C illustrates a closed end and an open end 
fragment of rAV.GMVLacZ. Figs. 7D, 7E and 7F indicate 
the mixture of open-ended and covalently closed duplex 

30 fragments generated by NotI digestion at position 4509 in 
the absence of terminal resolution. The NotI 4509 
digestion provides a convenient means of releasing a 361 
bp fragment that contains the right ITR in the context of 
a hybridization target (i.e. SV40 pA) • In the presence 

35 of terminal resolution, only the open-ended 361 bp 
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fragment would be expected to be generated (Fig* 7D) by 
such digestion. 

The resulting electrophoretic gel (not pictured) , 
revealed in lane (1) the results of digestion of a 
5 plasmid carrying an AV.CMVLacZ cDNA to release the rAAV 
vector, and subsequent digestion with Not! to release the 
right terminal 361 bp fragment. In lane (2) a sample of 
NotI digested Hirt DNA extracted from HeLa cells infected 
with wild-type Ad5 and transduced with AV.CMVLacZ 

10 resulted in the release of two fragments, labeled FormI 
and Formll. (See, also, Figs. 8A and 8B) • The migration 
of ss AV.CMVLacZ (SS) and RFm were also seen. 

The ds AV.CMVLacZ intermediates that accumulated in 
cells infected with adenovirus were likely the result of 

15 leading strand DNA synthesis, initiating from the duplex 
region of the vector ITR. In the absence of Rep, this 
conversion event was anticipated to generate molecules in 
which one end is open and the other is covalently closed 
(Fig, 7A) . To further characterize the structure of this 

20 ds intermediate Hirt extracts from cells coinfected with 
rAV.CMBLacZ and Ad5 were digested with NotI to release 
the termini of the ds intermediate which, if left open, 
would be approximately 361 bp in length. The resulting 
filters were hybridized with a probe specific for the 

25 SV40 polyadenylation signal positioned immediately 

upstream of the rightward ITR. At least two forms were 
released from the right end of duplex genomes, one that 
migrated to a position in the gel that predicted an 
open-ended conformation (Form II) , and a second slower 

30 migrating species (Form I) . Although this result was 

consistent with the model (Figs. 7A-7F) , it was difficult 
to predict with certainty the structure of Form I. Its 
retarded mobility did, however, suggest a conformation 
that differed from the open-ended Form II. 
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Example 14 - Analysis of AV.CMVLacZ Transduction 
Efficiency in 293 Cells Stablv Transfected with an 
Inducible E4 ORF6 cDNA 

Cell lines used in this assay were prepared as 
5 described in Example 1. 293(MT-ORF6) cells and HeLa(MT- 
ORF6) cells were seeded in 6 well 35 nun culture plates 
(2x10* cells/well) and infected with purified, heat- 
treated AV.CMVLacZ at an HOI of 1000 virus 
particles/cell. Induction of ORF6 expression with from 

10 none to increasing concentrations of zinc sulfate was 
initiated 2 hours before the addition of virus and 
continued throughout the duration of the experiment. 

Twenty-four hours after the addition of virus, cells 
were harvested, lysates were generated by sonication and 

15 analyzed for the S-galactosidase expression (i.e., 

fi-galactosidase activity) and virus DNA as described in 
the preceding examples. Hirt extracts were prepared from 
low molecular weight DNA from cell extracts. The 
preparation of the Hirt extracts and subsequent analysis 

20 by Southern hybridization were performed similarly to 
those described in the examples above. 

The results of this experiment were as follows: 
(1) Specific Activity 

The results are illustrated in the bar 

25 graph of Fig. 8A. Specific activity (milliunits 

S-galactosidase/mg protein) is plotted along the vertical 
axis. Below each bar is given the concentration of zinc 
used for induction, the fold-induction relative to 293 
cells, and the fold-induction relative to 293(ORF6) cells 

30 maintained in the absence of zinc. As shown in Fig. 8A, 
in the absence of Zn+2, the 293(MT-ORF6) cell line 
generated 39-fold higher levels of 6-galactosidase in 
rAAV infected 293 cells. Induction of ORF6 expression 
with increasing amounts of Zn +2 resulted in a concomitant 

35 rise in AV.CMVLacZ cell transduction to a level that was 
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445-fold greater than the parent 293 line. Expression of 
El alone was insufficient to have an effect in the 
adenovirus mediated augmentation of rAAV transduction. 

The specific activity of 6-galactosidase 
5 was 196.2 mUnits/mg in E1/E4 expressing 293 cells, 

compared to 1.0 mUnit/mg in 293 cells that only expressed 
El genes. These experiments support a mechanism for 
enhancing rAAV transduction that is dependent on the 
combined expression of both El and E4 adenoviral genes. 
10 (2) Molecular Analysis of the AV.GMVLacZ 

Genome 

The duplex monomer replicative form (RFm) 
was guantitated and the values (CPM) plotted along the, 
vertical axis in the bar graph of Fig. 8B. The 

15 concentration of zinc used for induction and the 

fold- induction relative to 293(ORF6) cells maintained in 
0 mM zinc is given below each bar. 

An autoradiogram (not pictured) shows the 
agarose gel resolved Hirt extracts from the AV.CtWLacZ 

20 transduced cells described above. A plasmid carrying the 
AV.CMVLacZ cDNA was digested to release the entire 
sequence and loaded in a lane of the autoradiogram. The 
band that appeared in this lane therefore reflected the 
migration of a monomer duplex replicative form (RFm) . 

25 The migration of the ss AV.CMVLacZ genome (SS) , RFm, and 
dimers of the duplex replicative form (RFd) were also 
shown. Lanes of the autoradiogram labeled (0>, (50), 
(100), (150), (200), and (250) contained samples from 
293(MT-ORF6) cells that were induced with the indicated 

30 concentration of zinc. A Hirt extract from 293 cells 
(lane labeled 293) transduced with AV.CMVLacZ was also 
shown. 

Analysis of Hirt extracts revealed the 
presence of the RFm in the rAAV infected 293(MT-ORF6) 
35 cells that was not present in similarly infected 293 



WO 96/39530 




PCT/US96/10245 



60 

cells. When the induction profiles (Figs. 8A and 8B) 
that describe AV.CMVLacZ transduction efficiency were 
compared, the results were plotted in Fig. 8C. Specific 
activity (milliunits 5-galactosidase/mg protein) data 
5 from Fig. 8A and counts-per-minute data (CPM) of 
AV.CMVLacZ RFm from Fig. 8B are plotted along the 
vertical axis, and concentration of zinc sulfate used 
during the experiment is shown along the horizontal axis. 

The two profiles are near mirror images. 

10 Importantly, the RFm increased in proportion to the 

increment in lacZ transducing activity that occurred as 
ORF-6 expression was induced with Zn +2 (Fig. 8C) . Similar 
results were obtained with a 293 derived cell line that 
expresses ORF6 from the glucocorticoid responsive MMTV 

15 promoter. 

Example 15 - Enhanced AV.CMVLacZ Transduction in HeLa 
Cells Carrying an Inducible ORF6 Miniaene 

HeLa(MT-ORF6) cells (2xl0 6 ) were transduced at an 
MOI of 1, 000 AV.CMViacZ recombinant, particles/cell in 

20 absence of zinc sulfate inducer or in the presence of 50, 
100, 150, 200, or 250 /xM zinc sulfate inducer in the 
media during transduction. Twenty-four hours later, 
cells were harvested, cell extracts were prepared by 
sonication, and analyzed for transgene expression (i.e., 

25 B-galactosidase activity) . Cell monolayers were 

histochemically stained for fl-galactosidase activity. 

The resulting photomicrographs (not pictured) 
illustrated that histochemical staining revealed an 
increase in the number of cells scored lacZ positive as 

30 the concentration of Zn +2 in the medium was raised from 0 
to 200 mM. Concentrations of 250 mM zinc were found to 
be toxic to the cells. 

Specific activity (milliunits £-galactosidase/mg 
protein) is plotted in Fig. 9 along the vertical axis. 

35 Below each bar is given the concentration of zinc used 
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for induction, the fold- induction relative to HeLa cells, 
and the fold-induction relative to HeLa (Mt-ORF6 ) cells 
maintained in the absence of zinc. Histochemical 
staining revealed an increase in the amount of 
5 fl-galactosidase in lysates as the concentration of Zn +2 
in the medium was raised from 0 to 200 mM. 
Example 16 - Southern Blot Analysis of Low Molecular 
Weight DNAs from AV.CMVLacg Transduced HeLa fMT-0RF6) 
CeUp 

10 following Enaction of E40RF6 

Hirt extracts were prepared from HeLa (MT-0RF6) cells 
transduced with AV.CMVLacZ as described in Example 15 in 
the presence of increasing concentrations of Zn +2 to 
determine whether synthesis of duplex intermediates 

15 contributed to the augmentation in AV.CMVLacZ 
transduction. 

Samples of HeLa (MT-0RF6 ) cells that were induced 
with a concentration of zinc sulfate (0, 50, 100, 150, 
200, and 250) were resolved on a 1.2% agarose Southern 

20 gel (not pictured) , transferred to a nylon membrane, and 
. hybridized with a LacZ-specif ic probe. One lane 

contained a plasmid encoding AV.CMVLacZ that was digested 
to release the entire. genome. Bands corresponding to the 
ss AV.CMVLacZ genome (SS) , duplex monomers (RFm) , and 

25 duplex dimers (RFd) were indicated on the gel. 

Southern analysis indicated that Hela and uninduced 
Hela (MT-0RF6 ) cells demonstrated a single band on 
Southern blots which comigrated with the ss genome. 
Induction of ORF-6 resulted in the appearance of 

30 detectable levels of ds monomer but only at higher 

concentrations of Zn+2. A band comigrating with the RFd 
was present in all cell preparations, the relevance of 
which is unclear since the monomer is a likely precursor 
to the dimer. 



WO 96/39530 




PCT/US96/10245 



62 

Example 17 - Effect of Adenovirus Infection on In Vivo 
AV.CMVLacZ Targeting Efficiency To Murine Liver 

The impact of adenoviral gene expression on rAAV 
transduction in murine liver was studied by sequentially 
5 infusing into the portal vein early gene mutants of 
adenovirus followed by rAAV. 

Balb/c mice, 4- to 6 -weeks old [Jackson 
Laboratories, Bar Harbor, Maine] were anesthetized by an 
intraperitoneal injection of ketamine (70 mg/kg) and 

10 xylazine (10 mg/kg). For liver studies, a 1 cm left 
flank incision was made and the spleen exposed* 

Samples of purified, heat-treated AV.CMVLacZ in 50 
Ml HBS (IxlO 11 virus particles) were used alone or spiked 
with helper adenovirus containing 2x1 0 10 A^ particles of 

15 purified dl!004, H5.CBALP, or ts!25 in a final volume of 
50 The dose of adenovirus was sufficient to 

transduce >25% of hepatocytes. The virus mixture was 
injected just beneath the splenic capsule and the abdomen 
was closed with 3-0 vicryl. 

20 Necropsies were performed 3 days post-infusion and 

tissue frozen in O.C.T. embedding compound. Frozen 
sections (6/xm) (LacZ+ALP) were prepared and 
histochemically stained for B-galactosidase enzyme and 
alkaline phosphatase activity. Sections were 

25 counterstained with neutral red and mounted. 

A B-galactosidase positive hepatocyte targeted with 
AV.GMVLacZ at magnification 2 OX was obtained. 
Histochemical analyses of liver tissue harvested 3 days 
after gene transfer demonstrated that administration of 

30 10 u particles of purified rAV.CMVLacZ alone into the 
portal vein was not associated with appreciable gene 
transfer (<0.01% of cells), confirming the inherent 
inefficiency of the rAAV system. 

Pre infusion with £4 deleted virus had no impact on 

35 rAAV transduction in mouse liver, whereas El deleted 
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virus demonstrated a modest increment in lacZ positive 
hepatocytes to about 0*1%. The most significant increase 
in rAAV transduction occurred following infusion of the 
E2a adenovirus mutant tsl25 with lacZ expression detected 
5 in 10-25% of hepatocytes. A direct relationship between 
adenovirus gene expression and rAAV transduction was 
demonstrated in animals infused with both lacZ rAAV and 
the ALP expressing El deleted virus. The dose of 
adenovirus was reduced 10-fold to minimize the 
10 coincidental occurrence of coinfection. Histochemical 
studies demonstrated co-localization of ALP and 6- 
galactosidase in the majority of 6-galactosidase 
expressing hepatocytes. 

Example 18 - Effect of Adenovirus Infection on In Vivo 

15 AV.CMVLacZ Targeting Efficiency To Murine Luna 

Experiments described in Example 17 for mouse liver 
were adapted for the study of rAAV mediated gene transfer 
to mouse lung. For lung experiments, anesthetized Balb/C 
animals were intubated as described in DeMatteo et al, 

20 Transpla ntation (Baltimoref , 59(5) :787-789 (1995). 

Briefly, a midline 2 cm skin incision was made in the 
neck to expose the trachea. A 2 inch 18 gauge 
angiocatheter was passed through the mouth, positioned in 
the midportion of the trachea, and connected to a rodent 

25 ventilator (#55-3438 Harvard). Polyethylene (PE#10, 

Intramedic) was fed through the catheter via a side port 
and advanced beyond the tracheal bifurcation. Using a 
Hamilton syringe, virus samples (30^1) were slowly 
infused into the lung through the polyethylene tubing. 

30 Samples contained the same formulation of purified, heat- 
treated AV.CMVLacZ with or without helper adenovirus, as 
described for liver injections. 

Tissue was harvested 72 hours post-infusion. Frozen 
sections were histochemically stained for 6-galactosidase 

35 activity and counter stained with neutral red. 
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Frozen sections from lung (AV.OfVLacZ) showed a B- 
galactosidase positive airway epithelial cell targeted 
with KV.CtWLacZ. Similar studies were performed in the 
murine model of lung-directed gene transfer. 
5 Adenoviruses were instilled into the trachea prior to the 
instillation of rAAV. Analysis of lung tissue 3 days 
later revealed only a rare fi-galactosidase positive cell 
in animals instilled with rAAV alone. No detectable 
enhancement of rAAV transduction was noted in animals 

10 preinstilled with adenovirus deleted of either El or E4. 
Substantial enhancement of transduction was achieved in 
conducting airway and alveolar cells of animals 
administered the E2a mutant adenovirus. 

These experiments in murine models of gene therapy 

15 directed to liver and lung verified that the efficiency 
of rAAV transduction is low due limited conversion of the 
input ss genome to a transcriptionally active ds 
intermediate, and that this conversion is facilitated by 
expression of adenovirus El and E4 gene products. 

20~ Example T9~Secbnd GeireraH on rAAV with Regulated " 
Minigene Capable of Enhanci ng Transduction 

The experiments described in previous examples 
illustrated the following principles: 1) purified rAAV is 
a relatively inefficient gene transfer vehicle in vitro 

25 and in vivo and 2) the rate limiting step in transduction 
is not viral entry but rather conversion of the virion" s 
ss DNA genome to a transcriptionally active ds DNA 
genome. Adenovirus can substantially enhance 
transduction through expression of a subset of its genes. 

30 It does this by promoting conversion of the virion's 

genome to its ds form. One approach to accomplish this 
is to incorporate into the recombinant AAV genome a 
minigene that expresses the minimal adenoviral genes 
necessary to enhance transduction , i.e., the 0RF6 region 

35 of E4. 
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Two approaches have been considered in designing 
this modified rAAV. The first strategy is based on a 
rAAV genome that has two transcriptional units in series, 
one expressing the therapeutic gene and the other 
5 expressing its E4 0RF6 from a constitutive promoter. 
While this may, in fact, be useful in many situations, 
constitutive expression of 0RF6 may be detrimental to the 
cell and potentially could elicit a destructive immune 
response. 

10 The second version of this rAAV includes the 

therapeutic minigene in addition to the ORF6 
transcriptional unit which, in this case, is expressed 
from an inducible promoter. When this second gene rAAV 
is administered to the cells (ex vivo strategies) or to 

15 the patient (in vivo strategies) , the inducing agents are 
administered at the time of gene transfer or soon 
thereafter. If the ds genomic form or its integrated 
derivative is stable, the induction of 0RF6 will only be 
necessary at the time of gene transfer into the recipient 

20 cell. Following this, its inducing agent will be 
withdrawn and the 0RF6 gene will be turned off. 

An rAAV that illustrates this concept of inducible 
ORF6 has been constructed and tested in vitro. A 
schematic of the vector pAV.CMVALP.GRE-ORF6, is shown in 

25 Fig. 11 and its sequence is illustrated in SEQ ID NO: 5. 
This second generation construct contains flanking 5 V and 
3 9 AAV ITR sequences. The human placental alkaline 
phosphatase cDNA (ALP) is included in a minigene in which 
the promoter from the immediate early gene of 

30 cytomegalovirus drives the transcription. A second 

transcriptional unit is cloned between the ITRs in series 
and in direct orientation with the alkaline phosphatase 
minigene. The second transcriptional unit expresses the 
Ad5-E4-0RF6 from a glucocorticoid dependent promoter 

35 (GRE) with an SV40 polyadenylation signal. This is 
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called a second generation rAAV construct. 

Specifically, pAV.CMVALP.GRE-0RF6 [SEQ ID NO: 5] 
generates a novel rAAV containing the LacZ transgene and 
the Ad E4 ORF 6 which facilitates ss to ds conversion of 
5 rAAV* The plasmid includes a flanking AAV 5 • ITR 

sequence (nucs. 53-219) ; CMV enhancer/promoter (nucs. 
255-848) ; human placenta alkaline phosphatase cDNA (ALP) 
(nucs. 914-2892); SV40 polyA (nucs. 2893-3090); GRE 
promoter (nucs. 3114-3393); Ad5 E4-0RF6 cDNA (nucs. 3402- 
10 4286); SV40 polyA (nucs. 4315-4512); and 3 1 AAV ITR 

(nucs. 4547-4713). All other nucleotides are plasmid 
derived. 

The second generation rAAV construct was used to 
produce and purify rAAV virions which were exposed to 

15 HeLa cells that were left untreated or incubated with 
dexamethasone. In the absence of dexamethasone, (a 
condition under which little 0RF6 should be expressed), 
little transduction was observed as measured by 
expression of the alkaline phosphatase gene. Cells 

20 incubated in dexamethasone expressed in ORF 6 gene and the 
transduction efficacy was enhanced at least 5-fold. This 
provides evidence to support that a gene product 
expressed from the rAAV can function in cis to enhance 
expression of the transgene. 

25 Example 20 - Application to Bone Marrow Directed Gene 
Theyapy 

Bone marrow directed gene therapy represents the 
paradigm of ex vivo gene therapy where the target cell is 
the hematopoietic stem cell. The basic strategy is to 

30 incorporate (i.e., integrate) a therapeutic minigene into 
the chromosomal DNA of hematopoietic stem cells which are 
transplanted into a recipient patient whose own bone 
marrow has been ablated allowing repopulation of its 
lymphohematopoietic system with progeny of the 

35 genetically corrected stem cell. 
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The problem with this approach has been efficiently 
transfecting genes into stem cells. Host studies of bone 
marrow directed gene therapy have utilized recombinant 
retroviruses which have not been very efficient. One 
5 problem is that retroviruses integrate their provirus 
only when the target cell is dividing. Unf ortunately, 
most stem cells in vitro are quiescent and not dividing. 
rAAV holds the promise of integrating the provirus more 
efficiently into non-dividing stem cells. However , 

10 purified rAAV is not very efficient with respect to 
integration when used alone. In cultured cells, 
integration is observed in less than 1% of the cells. 
The same conditions that activate the conversion of ss to 
ds genome also enhance the integration of the ds 

15 intermediate into the chromosomal DNA. 

Therefore, a desirable application of the methods 
and compositions of this invention is in bone marrow 
directed gene therapy. According to this method, stem 
cells are genetically modified with rAAV and an inducing 

20 agent ex vivo using the constructs and methods described 
above (see e.g., Example 19). Genetically modified stem 
cells are subsequently transplanted by conventional 
techniques • 

Numerous modifications and variations of the present 
25 invention are included in the above- identified 

specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to 
the compositions and processes of the present invention, 
such as selections of different transgenes and plasmids 
30 for the construction of the packaging cell lines and 
rAds, or selection or dosage of the viruses or immune 
modulators, are believed to be encompassed in the scope 
of the claims appended hereto. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1521.. 2405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CTGCATGTGT 


CAGAGGTTTT 


CACCGTCATC 


ACCGAAACGC 


GCGAGGCAGC 


50 


AAGCTTGGCA 


GAAATGGTTG 


AACTCCCGAG 


AGTGTCCTAC 


ACCTAGGGGA 


100 


GAAGCAGCCA 


AGGGGTTGTT 


TCCCACCAAG 


GACGACCCGT 


CTGCGCACAA 


150 


ACGGATGAGC 


CCATCAGACA 


AAGACATATT 


CATTCTCTGC 


TGCAAACTTG 


200 


GCATAGCTCT 


GCTTTGCCTG 


GGGCTATTGG 


GGGAAGTTGC 


GGTTCGTGCT 


250 


CGCAGGGCTC 


TCACCCTTGA 


CTCTTTCAAT 


AATAACTCTT 


CTGTGCAAGA 


300 


TTACAATCTA 


AACAATTCGG 


AGAACTCGAC 


CTTCCTCCTG 


AGGCAAGGAC 


350 


CACAGCCAAC 


TTCCTCTTAC 


AAGCCGCATC 


GATTTTGTCC 


TTCAGAAATA 


400 


GAAATAAGAA 


TGCTTGCTAA 


AAATTATATT 


TTTACCAATA 


AGACCAATCC 


450 


AATAGGTAGA 


TTATTAGTTA 


ctAtgttAag 


AAATGAATCA 


TTATCTTTTA 


500 


GTACTATTTT 


TACTCAAATT 


CAGAAGTTAG 


AAATGGGAAT 


AGAAAATAGA 


550 


AAGAGACGCT 


CAACCTCAAT 


TGAAGAACAG 


GTGCAAGGAC 


TATTGACCAC 


600 


AGGCCTAGAA 


GTAAAAAAGG 


GAAAAAAGAG 


TGTTTTTGTC 


AAAATAGGAG 


650 


ACAGGTGGTG 


GCAACCAGGG 


ACTTATAGGG 


GACCTTACAT 


CTACAGACCA 


700 


ACAGATGCCC 


CCTTACCATA 


TACAGGAAGA 


TATGACTTAA 


ATTGGGATAG 


750 


GTGGGTTACA 


GTCAATGGCT 


ATAAAGTGTT 


ATATAGATCC 


CTCCCCTTTC 


800 


GTGAAAGACT 


CGCCAGAGCT 


AGACCTCCTT 


GGTGTATGTT 


GTCTCAAGAA 


850 


AAGAAAGACG 


ACATGAAACA 


ACAGGTACAT 


GATTATATTT 


ATCTAGGAAC 


900 
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AGGAATGCAC 


TTTTGGGGAA 


AGATTTTCCA 


CTGGACTAAT 


AGAACATTAT 


TCTGCAAAAA 


6ATTA6CCTT 


GATTTGCCCA 


ACCTTGCGGT 


TTTGGTTACA 


AACTGTTCTT 


AAAACAAGGA 


GACTTGGTTT 


GGTATCAAAG 


GTTCTGATCT 


TCCTATGTTC 


TTTTGGAATT 


TATCCAAATC 


ACCAAGATAT 


AAAAGAGTGC 


TGATTTTTTG 


AACATTCACC 


TCTTGTGTGT 


TTGTGTCTGT 


GTCACTTATC 


CTTCACTTTC 


CAGAGGGTCC 


CTCAGGTCGG 


CCGACTGCGG 


CAGCTGGCGC 


AAGTGACCCT 


TGTCTTTATT 


TCTACTATTT 


TCTATCTTGT 


CTGGCTATCA 


TCACAAGAGC 


AACCAAGCTA 


GCCCAATTCG 


ATGACTACGT 


ACACTACGAC 


CAACACGATC 


TCGGTTGTCT 


GGATCGTCTA 


CCTCCTTTTG 


AGACAGAAAC 


ATCATCCGCT 


GCTGCCCGAA 


TGTAACACTT 


TACGTGCGAG 


GTCTTCCCTG 


CAGTGTGGGA 


GGTTGTTCCC 


TGGGATATGG 


TTCTAACGCG 


GGAAGTGTAT 


GCACGTGTGC 


CTGTGTTGTG 


AGCATGATGA 


TCCATGGTTA 


CGAGTCCTGG 


CAGTCCCGGT 


TCCCTGCAGT 


GTATAGCCGG 


GGTTTAGGAT 


GGTGGTGGAT 


GGCGCCATGT 


TACCGGGAGG 


TGGTGAATTA 


CAACATGCCA 


CAGCGTGTTT 


ATGAGGGGTC 


GCCACTTAAT 


ATGGCCACGT 


GGGTTCTGTG 


GTCCCCGCCA 


TTGCACTGTG 


GGATTTTGAA 


CAATATTGTG 
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TACCAAGGAG 


GGGACAGTGG 


950 


CTTATGGCAT 


GAGTTATTAT 


1000 


TCCCAAGGCT 


TAAGTAAGTT 


1050 


TGTGAGACAA 


GTGGTTTCCT 


1100 


GAGCTCTGAG 


TGTTCTATTT 


1150 


TTATGTAAAT 


GCTTATGTAA 


1200 


AGTAAACTTG 


CAACAGTCCT 


1250 


TCGCCATCCC 


GTCTCCGCTC 


1300 


CCCCGCAGAC 


CCCGGCGACC 


1350 


CCGAACAGGG 


ACCCTCGGAT 


1400 


TGTGTTCGTC 


TTGTTTTGTC 


1450 


GGAACGGACT 


CACCTCAGGG 


1500 


CCGGCGTTCC 


ATTTGGCATG 


1550 


CGGCGCACTC 


CGTACAGTAG 


1600 


CCGCGCTACC 


ATACTGGAGG 


1650 


TGACAATGCA 


CAACGTGAGT 


1700 


TTTACGCTGA 


TTCAGGAATG 


1750 


GGAGGAGCTT 


GTAATCCTGA 


1800 


CCAACATTGA 


TATCATGACG 


1850 


GCTCTCCACT 


GTCATTGTTC 


1900 


CGGGCAGGTT 


TTGGCCAGCT 


1950 


TTAATCAGAG 


GTTTATATGG 


2000 


AAAGAGGTAA 


TGTTTATGTC 


2050 


CTACCTGCGC 


TTGTGGTATG 


2100 


TGAGCTTTGG 


ATACAGCGCC 


2150 


GTGCTGTGCT 


GCAGTTACTG 


2200 
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TGCTGATTTA 


AGTGAGATCA 


GGGTGCGCTG 


GCCTTATGCT 


GCGGGCGGTG 


CGAATCATCG 


TTGTATTCCT 


GCAGGACGGA 


GCGGCGGCGG 


GCTGCAGCAC 


CACCGCCCTA 


TCCTGATGCA 


TGTAGGGATC 


CAAGCTTGCG 


GGCGCATCGA 


TGCAGGTCGA 


CTCTAGAGGA 


TCCCGGGTGG 


AGTGCCTCTC 


CTGGCCCTGG 


AAGTTGGCAC 


TCCTAATAAA 


ATTAAGTTGN 


ATCATTTTGT 


AATATTATGG 


GGTGGAGGGG 


GGTGGTATGG 


GACAANCTGT 


AGGGCCTGCG 


GGGTCTATTG 


GGCACAATCT 


TGGCTCACTG 


CAATCTCCGC 


TCCTGCCTCA 


GACTCCCGAG 


TTGTTGGGAT 


CTCAGATAAT 


TTTTGTTTTT 


TTGGTAGAGA 


CAGGCTGGTC 


TCCAACTCCT 


AATCTCAGGT 


CCAAATTGCT 


GGGATTACAG 


GNGTGAACCA 


CTGATTTTAA 


AATAACTATA 


CCAGCAGGAG 


GCTACCTGGC 


CATGCCCAAC 


CGGTGGGACA 


CTGTCCTCTC 


ATGCGTTGGG 


TCCACTCAGT 


ACGCGGCCAG 


CTTGGCTGTG 


GAATGTGTGT 


CCCAGGCTCC 


CCAGCAGGCA 


GAAGTATGCA 


CAGCAACCAG 


GTGTGGAAAG 


TCCCCAGGCT 


CAAAGCATGC 


ATCTCAATTA 


GTCAGNAACC 


GTCCATCCCG 


GCCCTAACTC 


NGGCCAGTTC 


GNTGAGTAAT 


TTGCNNGATT 


TATGCAGNGG 


GAGNTNTTCC 


AGAAGTAGTG 


AGGAGGCTTT 


TGGGATCTGA 


TCAAGAGACA 


GGATGAGGAT 
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CTGTGCCCGG 

X* A X* X X* X* X^X^ X* 


AGGACAAGGC 


2250 


CTGAGGAGAC 


CACTGCCATG 

v**»x* x wwwnA x? 


2300 


C AG C AGTTTA 


TTCGCGCGCT 


2350 


CGATTATGAC 


TCTACCCCCA 

X V* A«%x»x^X^ WV^aX 


2400 


TGATATCAAG 


CTTGCATGCC 


2450 


NATCCCTGTG 


ACCCCTCCCC 


2500 


TCCAGTGCCC 


ACCAGCCTTG 


2550 


CTGACTAGGT 

X« A X***X^ ^ 1W Xi# A 


GTCCTTCTAT 


2600 


AGCAANGGGN 


AANTTGGNAA 


2650 


GG AACAAGCT 


GGAGTGCAGT 

V7V7ZXX7 X X^JTXXJ X 


2700 


OTCCPGGGTT 

W X WW J. W WW X J. 


CAAGCGATTC 

vlulw W w(i X X W 


2750 


TCCAGGCATG 


CATGACCAGG 
wxxx wjtxw wnww 


2800 

x7 V V 


CGGGGTTTCA 


CCATATTGGN 

WWXLXXIX X V7x7&^ 


2850 


wAX wXXV wwwA 


ww X X ww ww X w 


A 7 w x# 


w X Ull X www X X 


WWWX wX WWX X 


2950 


GACGTCCAGA 
wnwwx vwiun 


CACAGCATAG 

WlWlw W* X **x* 


3000 


TTTG AGTTG C 

xxx wn\j x x vj w 


TTG CTTGG CA 

X X WWX X ww wtT 


3050 


AGATGCCTGT 


TGAATTGGGT 


3100 


CAGTTAGGGT 


GTGGAAAGTC 

x* j> x* w*umaw a x^ 


3150 


AAGCATGCAT 

nnw vn a x^ wn A 


CT CAATT AGT 


3200 


CCCCAGCAGG 


CAGAAGTATG 

wnwnnw x«*x 


3250 


ATAGNCCCGC 


CCCTAACTCC 


3300 


CGACCNTNCT 


CCGGCNNATG 


3350 


GCGAGGNCGC 


CTCGGGCTCT 


3400 


NNTGGTGGAA 


TTGATCAGCT 


3450 


CGNNNCGNAT 


GATTGAACAA 


3500 
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GATGGGTTGC ACGGAGGTTC TCCGGNCGCT TGGGTGGGGA GGNTATTCGG 3550 

NTATTNTTGG TGNACAACAG NNAAACGGNT GTTCTGATGC CGCCGCGTTC 3600 

NCGCTTTCAG NGCAGGGGGG CCCCCCTTCT NTTGAGANNA GCNCCCCTTN 3650 

TTG 3653 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Thr Ser Gly Val Pro Phe Gly Met Thr Leu Arg Pro Thr Arg 
15 10 15 

Ser Arg Leu Ser Arg Arg Thr Pro Tyr Ser Arg Asp Arg Leu Pro Pro 
20 25 30 

Phe Glu Thr Glu Thr Arg Ala Thr lie Leu Glu Asp His Pro Leu Leu 
35 40 45 

Pro Glu Cys Asn Thr Leu Thr Met His Asn Val Ser Tyr Val Arg Gly 
50 55 60 

Leu Pro Cys Ser Val Gly Phe Thr Leu lie Gin Glu Trp Val Val Pro 
65 70 75 80 

Trp Asp Met Val Leu Thr Arg Glu Glu Leu Val lie Leu Arg Lys Cys 

85 90 95 

Met His Val Cys Leu Cys Cys Ala Asn lie Asp lie Met Thr Ser Met 
100 105 110 

Met lie Tyr Gly Tyr Glu Ser Trp Ala Leu His Cys His Cys Ser Ser 
115 120 125 

Pro Gly Ser Leu Gin Cys He Ala Gly Gly Gin Val Leu Ala Ser Trp 
130 135 140 

Phe Arg Met Val Val Asp Gly Ala Met Phe Asn Gin Arg Phe He Trp 
145 150 155 160 
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Tyr Arg Glu Val Val Asn Tyr Asn Met Pro Lys Glu Val Met Phe Met 

165 170 175 

Ser Ser Val Phe Met Arg Gly Arg His Leu lie Tyr Leu Arg Tyr Trp 
180 185 190 

Tyr Asp Gly His Val Gly Ser Val Val Pro Ala Met Ser Phe Gly Tyr 
195 200 205 

Ser Ala Leu His Cys Gly lie Leu Asn Asn lie Val Val Leu Cys Cys 
210 215 220 

Ser Tyr Cys Ala Asp Leu Ser Glu lie Arg Val Arg Cys Cys Ala Arg 
225 230 235 240 

Arg Thr Arg Arg Leu Met Leu Arg Ala Val Arg lie lie Ala Glu Glu 

245 250 255 

Thr Thr Ala Met Leu Tyr Ser Cys Arg Thr Glu Arg Arg Arg Gin Gin 
260 265 270 

Phe lie Arg Ala Leu Leu Gin His His Arg Pro lie Leu Met His Asp 
275 280 285 

Tyr Asp Ser Thr Pro Met 
290 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: other nucleic acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 50 

GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 100 

TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 150 

GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 200 

GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 250 

CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 300 
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AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


GGGAGATCAG 


CCTGCAGGTC 


GTTACATAAC 


GGCTGACCGC 


CCAACGACCC 


CCGCCCATTG 


TCCCATAGTA 


ACGCCAATAG 


GGACTTTCCA 


ATTTACGGTA 


AACTGCCCAC 


TTGGCAGTAC 


AGTACGCCCC 


CTATTGACGT 


CAATGACGGT 


TGCCCAGTAC 


ATGACCTTAT 


GGGACTTTCC 


CGAGGCCACG 


TTCTGCTTCA 


CTCTCCCCAT 


CAATTTTGTA 


TTTATTTATT 


TTTTAATTAT 


GGGGGGGGGG 


GGGGGCGCGC 


GCCAGGCGGG 


GGGCGGGGCG 


AGGCGGAGAG 


GTGCGGCGGC 


TCCGAAAGTT 


TCCTTTTATG 


GCGAGGCGGC 


AAGCGAAGCG 


CGCGGCGGGC 


GGGAGCGGGA 


CGCAATTCCC 


GGGGATCGAA 


AGAGCCTGCT 


ATGTCGTTTA 


CTTTGACCAA 


CAAGAACGTG 


AGGCATTGGT 


CTGGACACCA 


GCAAGGAGCT 


TTTTACAACG 


TCGTGACTGG 


GAAAACCCTG 


CTTGCAGCAC 


ATCCCCCTTT 


CGCCAGCTGG 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


TTGCCTGGTT 


TCCGGCACCA 


GAAGCGGTGC 


GATCTTCCTG 


AGGCCGATAC 


TGTCGTCGTC 


CGGTTACGAT 


GCGCCCATCT 


ACACCAACGT 


ATCCGCCGTT 


TGTTCCCACG 


GAGAATCCGA 


TTTAATGTTG 


ATGAAAGCTG 


GCTACAGGAA 


TGATGGCGTT 


AACTCGGCGT 


TTCATCTGTG 


GTTACGGCCA 


GGACAGTCGT 


TTGCCGTCTG 
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TAGCGCGTAA 


TATTTGTCTA 


350 


TTACGGTAAA 


TGGCCCGCCT 


400 


ACGTCAATAA 


TGACGTATGT 


450 


TTGACGTCAA 


TGGGTGGAGT 


500 


ATCAAGTGTA 


TCATATGCCA 


550 


AAATGGCCCG 


CCTGGCATTA 


600 


TACTTGGCAG 


TACATCTACT 


650 


CTCCCCCCCC 


TCCCCACCCC 


700 


TTTGTGCAGC 


GATGGGGGCG 


750 


GCGGGGCGGG 


GCGAGGGGCG 


800 


AGCCAATCAG 


AGCGGCGCGC 


850 


GGCGGCGGCG 


GCCCTATAAA 


900 


TCAGCCACCG 


CGGTGGCGGC 


950 


AAAGCAAAAA 


AGAAGTCACC 


1000 


ATTTTCGTTG 


CCGGTCTGGG 


1050 


GCTCAAGCGC 


GATCCCGTCG 


1100 


GCGTTACCCA 


ACTTAATCGC 


1150 


CGTAATAGCG 


AAGAGGCCCG 


1200 


CCTGAATGGC 


GAATGGCGCT 


1250 


CGGAAAGCTG 


GCTGGAGTGC 


1300 


CCCTCAAACT 


GGCAGATGCA 


1350 


AACCTATCCC 


ATTACGGTCA 


1400 


CGGGTTGTTA 


CTCGCTCACA 


1450 


GGCCAGACGC 


GAATTATTTT 


1500 


GTGCAACGGG 


CGCTGGGTCG 


1550 


AATTTGACCT 


GAGCGCATTT 


1600 
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TTACGCGCCG 


GAGAAAACCG 


CCTCGCGGTG 


ATGGTGCTGC 


GTTGGAGTGA 


1650 


0m, 0m 0m 0m. *i ^vmtvi^ m 

CGGCAGTTAT 


CTGGAAGATC 


AGGATATGTG 


GCGGATGAGC 


GGCATTTTCC 


1700 


0m rmm 0m m\ 0m 0m m 0m>t V\ 

GTGACGTCTC 


GTTGCTGCAT 


^ ^ *m\ ^m A+m a^m m 

AAACCGACTA 


CACAAATCAG 


CGATTTCCAT 


1750 


GTTGCCACTC 


GCTTTAATGA 


TGATTTCAGC 


CGCGCTGTAC 


TGGAGGCTGA 


1800 


AGTTCAGATG 


TGCGGCGAGT 


TGCGTGACTA 


CCTACGGGTA 


ACAGTTTCTT 


1850 


TATGGCAGGG 


TGAAACGCAG 


GTCGCCAGCG 


GCACCGCGCC 


TTTCGGCGGT 


1900 


GAAATTATCG 


ATGAGCGTGG 


TGGTTATGCC 


GATCGCGTGA 


CACTACGTCT 


1950 


GAACGTCGAA 


AACCCGAAAC 


TGTGGAGCGC 


CGAAATCCCG 


AATCTCTATC 


2000 


A^mWmmAWm ^« *mmw% m%mm Ammmwm 

GTGCGGTGGT 


TGAACTGCAC 


ACCGCCGACG 


GCACGCTGAT 


TGAAGCAGAA 


2050 


GCCTGCGATG 


TCGGTTTCCG 


CGAGGTGCGG 


ATTGAAAATG 


GTCTGCTGCT 


2100 


GCTGAACGGC 


AAGCCGTTGC 


TGATTCGAGG 


CGTTAACCGT 


CACGAGCATC 


2150 


ATCCTCTGCA 


TGGTCAGGTC 


ATGGATGAGC 


AGACGATGGT 


GCAGGATATC 


2200 


CTGCTGATGA 


AGCAGAACAA 


CTTTAACGCC 


GTGCGCTGTT 


CGCATTATCC 


2250 


GAACCATCCG 


CTGTGGTACA 


CGCTGTGCGA 


CCGCTACGGC 


CTGTATGTGG 


2300 


TGGATGAAGC 


CAATATTGAA 


ACCCACGGCA 


TGGTGCCAAT 


GAATCGTCTG 


2350 


^ A^mA^mAmm ^ mam ^ m-mmAmm 

ACCGATGATC 


CGCGCTGGCT 


ACCGGCGATG 


AGCGAACGCG 


TAACGCGAAT 


2400 


GGTGCAGCGC 


GATCGTAATC 


ACCCGAGTGT 


GATCATCTGG 


TCGCTGGGGA 


2450 


ATGAATCAGG 


Amm,A^m*m\. A9m A*mm Amm Amm Amm Amm nt 

CCACGGCGCT 


AATGACGACG 


CGCTGTATCG 


CTGGATCAAA 


2500 


ww\ 0m*^mm 0m m 0m 0m m\ wmm^m 

TCTGTCGATC 


CTTCCCGCCC 


GGTGCAGTAT 


GAAGGCGGCG 


GAGCCGACAC 


2550 


CACGGCCACC 


GATATTATTT 


&m 0m 0m 0m 0m m\ *tmm*mmm\ 

GCCCGATGTA 


A^m%Awm Am* Amm Amm, Am% Amm aw** ww% Amm 

CGCGCGCGTG 


0m m\ Mia v m\ 0m m\ Mm) 0m 

GATGAAGACC 


2600 


m A mm Amm AffHn Amm A+m A+m, 

AGCCCTTCCC 


GGCTGTGCCG 


AAATGGTCCA 


TCAAAAAATG 


GCTTTCGCTA 


2650 


CCTGGAGAGA 


CGCGCCCGCT 


GATCCTTTGC 


GAATACGCCC 


ACGCGATGGG 


2700 


TAACAGTCTT 


GGCGGTTTCG 


CTAAATACTG 


GCAGGCGTTT 


CGTCAGTATC 


2750 


CCCGTTTACA 


GGGCGGCTTC 


GTCTGGGACT 


GGGTGGATCA 


GTCGCTGATT 


2800 


AAATATGATG 


AAAACGGCAA 


CCCGTGGTCG 


GCTTACGGCG 


GTGATTTTGG 


2850 


CGATACGCCG 


AACGATCGCC 


AGTTCTGTAT 


GAACGGTCTG 


GTCTTTGCCG 


2900 
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ACCGCACGCC 


GCATCCAGCG 


CTGACGGAAG 


CAAAACACCA 


GCAGCAGTTT 


2950 


TTCCAGTTCC 


GTTTATCCGG 


GCAAACCATC 


GAAGTGACCA 


GCGAATACCT 


3000 


GTTCCGTCAT 


AGCGATAACG 


AGCTCCTGCA 


CTGGATGGTG 


GCGCTGGATG 


3050 


GTAAGCCGCT 


GGCAAGCGGT 


GAAGTGCCTC 


TGGATGTCGC 


TCCACAAGGT 


3100 


AAACAGTTGA 


TTGAACTGCC 


TGAACTACCG 


CAGCCGGAGA 


GCGCCGGGCA 


3150 


ACTCTGGCTC 


ACAGTACGCG 


TAGTGCAACC 


GAACGCGACC 


GCATGGTCAG 


3200 


AAGCCGGGCA 


CATCAGCGCC 


TGGCAGCAGT 


GGCGTCTGGC 

WWW W A W A WWW 


GGAAAACCTC 

V7UAAAAVV X W 




AGTGTGACGC 


TCCCCGCCGC 


GTCCCACGCC 


ATCCCGCATC 


TGACCACCAG 




CGAAATGGAT 


TTTTGCATCG 


AGCTGGGTAA 


TAAGCGTTGG 

^k A inw ^^^tf ^k ^k %^ 


CAATTTAACC 


3350 


GCCAGTCAGG 


CTTTCTTTCA 


CAGATGTGGA 


TTGGCGATAA 


AAAACAACTG 


3400 


CTGACGCCGC 


TGCGCGATCA 


GTTCACCCGT 


GCACCGCTGG 


ATAACGACAT 


3450 


TGGCGTAAGT 


GAAGCGACCC 


GCATTGACCC 


TAACGCCTGG 


GTCGAACGCT 


3500 


GGAAGGCGGC 


GGGCCATTAC 


CAGGCCGAAG 


CAG CGTTGTT 


GCAGTGCACG 


3550 


GCAGATACAC 


TTGCTGATGC 


GGTGCTGATT 


ACGACCGCTC 


ACGCGTGGCA 


3600 


GCATCAGGGG 


AAAACCTTAT 


TTATCAGCCG 


GAAAACCTAC 


CGGATTGATG 


3650 


GTAGTGGTCA 


AATGGCGATT 


ACCGTTGATG 


TTGAAGTGGC 


GAGCGATACA 


3700 


CCGCATCCGG 


CGCGGATTGG 


CCTGAACTGC 


CAGCTGGCGC 


AGGTAGCAGA 


3750 


GCGGGTAAAC 


TGGCTCGGAT 


TAGGGCCGCA 


AGAAAACTAT 


CCCGACCGCC 


3800 


TTACTGCCGC 


CTGTTTTGAC 


CGCTGGGATC 


TGCCATTGTC 


AGACATGTAT 


3850 


ACCCCGTACG 


TCTTCCCGAG 


CGAAAACGGT 


CTGCGCTGCG 


GGACGCGCGA 


3900 


ATTGAATTAT 


GGCCCACACC 


AGTGGCGCGG 


CGACTTCCAG 


TTCAACATCA 


3950 


Is w wl? w 1 AlJAw 


TCAACAGCAA 


CTGATGGAAA 


CCAGCCATCG 


CCATCTGCTG 


4000 


CACGCGGAAG 


AAGGCACATG 


GCTGAATATC 


GACGGTTTCC 


ATATGGGGAT 


4050 


TGGTGGCGAC 


GACTCCTGGA 


GCCCGTCAGT 


ATCGGCGGAA 


TTACAGCTGA 


4100 


GCGCCGGTCG 


CTACCATTAC 


CAGTTGGTCT 


GGTGTCAAAA 


ATAATAATAA 


4150 


CCGGGCAGGC 


CATGTCTGCC 


CGTATTTCGC 


GTAAGGAAAT 


CCATTATGTA 


4200 
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CTATTTAAAA 


AACACAAACT 


TTTGGATGTT 


ACTriTTTAT 


CATGGGAGCC 


TACTTCCCGT 


GACATCAACC 


ATATCAGCAA 


AAGTGATACG 


TTCTCTGTTC 


TCGCTATTAT 


TCCAACCGCT 


AACTCGGCCT 


CGACTCTAGG 


CGGCCGCGGG 


CATTGATGAG 


TTTGGACAAA 


CCACAACTAG 


TTATTTGTGA 


AATTTGTGAT 


GCTATTGCTT 


TGCAATAAAC 


AAGTTAACAA 


CAACAATTGC 


TCAGGGGGAG 


GTGTGGGAGG 


TTTTTTCGGA 


GGCTGATCAG 


TGGAAGGTGC 


TGAGGTACGA 


GACCCTGCGA 


GTGTGGCGGT 


AAACATATTA 


GATGTGACCG 


AGGAGCTGAG 


GCCCGATCAC 


CGCTGAGTTT 


GGCTCTAGCG 


ATGAAGATAC 


GTGGGCGTGG 


CTTAAGGGTG 


GGAAAGAATA 


TAGTTTTGTA 


TCTGTTTTGC 


AGCAGCCGCC 


GTTTGATGGA 


AGCATTGTGA 


GCTCATATTT 


GGGCCGGGGT 


GCGTCAGAAT 


GTGATGGGCT 


GTCCTGCCCG 


CAAACTCTAC 


TACCTTGACC 


GCCGTTGGAG 


ACTGCAGCCT 


CCGCCGCCGC 


CCCGCGGGAT 


TGTGACTGAC 


TTTGCTTTCC 


GCAGCTTCCC 


GTTCATCCGC 


CCGCGATGAC 


ACAATTGGAT 


TCTTTGACCC 


GGGAACTTAA 


TGGATCTGCG 


CCAGCAGGTT 


TCTGCCCTGA 


GCGGTTTAAA 


ACATAAATAA 


AAAACCAGAC 


GCAAGTGTCT 


TGCTGTCTTT 


ATTTAGGGGT 


GGGACCAGCG 


GTCTCGGTCG 


TTGAGGGTCC 
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CGGTTTATTC 


TTTTTCTTTT 


4250 


TTTTCCCGAT 


TTGGCTACAT 


4300 


GGTATTATTT 


TTGCCGCTAT 


4350 


GTTTGGTCTG 


CTTTCTGACA 


4400 


GATCCAGACA 

\JX« JL V* N**»W^ *»%*** 


TGATAAGATA 

A W^*» X ****Vrf** J» ** 


4450 


AATGCAGTGA 


AAAAAATGCT 


4500 


TATTTGTAAC 


CATTATAAGC 


4550 


ATTCATTTTA 


TGTTTCAGGT 


4600 


TCCTCTAGAG 

X WW X W X **wX*w 


TCGACCTGCA 


4650 


TGAGACCGGC 

X wx\wx»W w W w W 


ACCAGGTGCA 

X* W W/l w w X w WA 


4700 

T / WW 


GGAAPPAGCC 


TGTGATGCTG 

X w X wAX w W X w 


4750 


X x Vju X X w 


eeTGfACCCG 

WW X w wxlwwww 


"SOW 


AGAVTGAGGT 

nUAl XwA\7wX 


AOTGAAATGT 

*>W X VJiUlAAV X 


4850 


TATAAGGTGG 


GGGTCTTATG 

Www X \* X X**X 


4900 


GCCGCCATGA 


GCACCAACTC 


4950 


GACAACGCGC 


ATGCCCCCAT 


5000 


CCAGCATTGA 


TGGTCGCCCC 


5050 


TACGAGACCG 

X X* WW** wX* W W w 


TGTCTGGAAC 


5100 


TTCAGCCGCT 

X X Wf*w W www X 


GCAGCCACCG 

\3 wn v» w w** w w w^ 


5150 


TGAGCCCGCT 

X NJAw W W WWW X 


TGCAAGCAGT 


5200 


AAGTTGACGG 


CTCTTTTGGC 


5250 


TGTCGTTTCT 


CAGCAGCTGT 


5300 


AGGCTTCCTC 


CCCTCCCAAT 


5350 


TCTGTTTGGA 


TTTGGATCAA 


5400 


TTTGCGCGCG 


CGGTAGGCCC 


5450 


TGTGTATTTT 


TTCCAGGACG 


5500 
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T6GTAAA6GT 


GACTCTGGAT 


GTTCAGATAC 


6G66T6GAG6 


TAGCACCACT 


GCAGAGCTTC 


AGATGATCCA 


GTCGTAGCAG 


GAGCGCTGGG 


TTCAGTAGCA 


AGCTGATTGC 


CAGGGGCAGG 


AAAGCGGTTA 


AGCTGGGATG 


GGTGCATACG 


TGGACTGTAT 


TTTTAGGTTG 


GCTATGTTCC 


TTCATGTTGT 


GCAGAACCAC 


CAGCACAGTG 


TTTGTCATGT 


AGCTTAGAAG 


GAAATGCGTG 


TGTGACCTCC 


AAGATTTTCC 


ATGCATTCGT 


CCACGGGCGG 


CGGCCTGGGC 


GAAGATATTT 


GTTGTGTTCC 


AGGATGAGAT 


CGTCATAGGC 


GGAGGGTGCC 


AGACTGCGGT 


ATAATGGTTC 


TTACCCTCAC 


AGATTTGCAT 


TTCCCACGCT 


CATGTCTACC 


TGCGGGGCGA 


TGAAGAAAAC 


TCAGCTGGGA 


AGAAAGCAGG 


TTCCTGAGCA 


GTGGGCCCGT 


AAATCACACC 


TATTACCGGG 


GCTGCAGCTG 


CCGTCATCCC 


TGAGCAGGGG 


CCCTGACTCG 


CATGTTTTCC 


CTGACCAAAT 


CCCAGCGATA 


GCAGTTCTTG 


CAAGGAAGCA 


ACCGTCCGCC 


GTAGGCATGC 


TTTTGAGCGT 


GGTCCCACAG 


CTCGGTCACC 


TGCTCTACGG 


CCTCGTTTCG 


CGGGTTGGGG 


CGGCTTTCGC 


TCGTCCAGAC 


GGGCCAGGGT 


CATGTCTTTC 


CAGCGTAGTC 


TGGGTCACGG 


TGAAGGGGTG 


CCAGGGTGCG 


CTTGAGGCTG 


GTCCTGCTGG 


TCGCCCTGCG 


CGTCGGCCAG 


GTAGCATTTG 
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ATGGG CAT AA 


GCCCGTCTCT 


e e c rt 

5550 


AT6CTGCGG6 


GTGGTGTTGT 


5600 


CGTGGTGCCT 


AAAAATGTCT 


5650 


CCCTTGGTGT 


AAGTGTTTAC 


5700 


TGGGGATATG 


AGATGCATCT 


5750 


CAGCCATATC 


CCTCCGGGGA 


5800 


TATCCGGTGC 


ACTTGGGAAA 


5850 


GAAGAACTTG 


GAGACGCCCT 


5900 


CCATAATGAT 


GGCAATGGGC 


5950 


CTGGGATCAC 


TAACGTCATA 


6000 


CATTTTTACA 


AAGCGCGGGC 


6050 


CATCCGGCCC 


AGGGGCGTAG 


6100 


TTGAGTTCAG 


ATGGGGGGAT 


6150 


GGTTTCCGGG 


GTAGGGGAGA 


6200 


GCTGCGACTT 


ACCGCAGCCG 


6250 


TGCAACTGGT 


AGTTAAGAGA 


6300 


GGCCACTTCG 


TTAAGCATGT 


6350 


CCGCCAGAAG 


GCGCTCGCCG 


6400 


AAGTTTTTCA 


ACGGTTTGAG 


6450 


TTGACCAAGC 


AGTTCCAGGC 


6500 


CATCTCGATC 


CAGCATATCT 


6550 


TGTACGGCAG 


TAGTCGGTGC 


6600 


CACGGGCGCA 


GGGTCCTCGT 


6650 


CGCTCCGGGC 


TGCGCGCTGG 


6700 


TGCTGAAGCG 


CTGCCGGTCT 


6750 


ACCATGGTGT 


CATAGTCCAG 


6800 
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CCCCTCCGCG 


GCGTGGCCCT 


TGGCGCGCAG 


CGCACGAGGG 


GCAGTGCAGA 


CTTTTGAGGG 


AATACCGATT 


CCGGGGAGTA 


GGCATCCGCG 


CTCGCATTCC 


ACGAGCCAGG 


TGAGCTCTGG 


GGTTTCCCCC 


ATGCTTTTTG 


ATGCGTTTCT 


CGGTGTCCAC 


GCTCGGTGAC 


GAAAAGGCTG 


CTTGAGAGGC 


CTGTCCTCGA 


GCGGTGTTCC 


ACTCGGACCA 


CTCTGAGACA 


AAGGCTCGCG 


GCTAAGTGGG 


AGGGGTAGCG 


GTCGTTGTCC 


CAGGGTGTGA 


AGACACATGT 


CGCCCTCTTC 


GTTTGTAGGT 


GTAGGCCACG 


TGACCGGGTG 


AAGGGGGTGG 


GGGCGCGTTC 


GTCCTCACTC 


GAGGGCCAGC 


TGTTGGGGTG 


AGTACTCCCT 


CTGCGCTAAG 


ATTGTCAGTT 


TCCAAAAACG 


TGGCCCGCGG 


TGATGCCTTT 


GAGGGTGGCC 


GACAATCTTT 


TTGTTGTCAA 


GCTTGGTGGC 


TGGACAGCAA 


CTTGGCGATG 


GAGCGCAGGG 


GCGCGCTCCT 


TGGCCGCGAT 


GTTTAGCTGC 


CCGCCATTCG 


GGAAAGACGG 


TGGTGCGCTC 


GCCAACCGCG 


GTTGTGCAGG 


GTGACAAGGT 


CCGCGTAGGC 


GCTCGTTGGT 


CCAGCAGAGG 


GAATGGCGGT 


AGGGGGTCTA 


GCTGCGTCTC 


CGGTAAAGAC 


CCCGGGCAGC 


AGGCGCGCGT 


CCTTGCAAGT 


CTAGCGCCTG 


CTGCCATGCG 


GTATGGGTTG 


AGTGGGGGAC 


CCCATGGCAT 


CGTACATGCC 


GCAAATGTCG 


TAAACGTAGA 
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PTTCPPPTP'TYl 




6850 

VI V/ v/ 




Pr2f2PCPCZlC21 


D7UU 




PGPAGAPGGT 




U>V*G X X UGGGG 


ItAAAAAtUi 


/ uuu 




X XwQATGAGC 


*7 AK A 


X CCG X G ittL 


CG lATAUAbA 


/1UU 


GCGGTCCTCC 


X LbTATAbAA 


•71 CIA 


X UWiubLvJiiu 


UALbAAbbAb 


7^nn 


AL X AGGGGG X 


^WAt X LbL X v# 




GGUA1 CAAGG 


AAbbibAl lb 




X X LU X GAAGG 


GGGGUXAXAA 


/ 


TCTTCCGCAT 


r%r* r^xryf* morn/"* 
CGCTGTC1GC 


/ 400 


CTGAAAAGCG 


GGGATGACTT 


7450 


AGGAGGATTT 


GATATTCACC 


7500 


GCATCCATCT 


GGTCAGAAAA 


T C C A 

7550 


AAACGACCCG 


TAbAbbbUb X 


/ ouu 


XX XGGX XXX X 


G X tbwbA iLb 


/ ODU 




G l~G UiA W V» Wl 


77nn 


G X LbbbLALU 


AbblbWivbv 




CAACGCTGGT 


bbLTACb IbX 


/OUU 


UCibULbLLL X 


X bbbbbAbLA 


/ oOU 


GTCCGGGGGG 


TCTGCGTCCA 


7900 


CGAAGTAGTC 


TATCTTGCAT 


7950 


CGGGCGGCAA 


GCGCGCGCTC 


8000 


GGGGTGGGTG 


AGCGCGGAGG 


8050 


GGGGCTCTCT 


GAGTATTCCA 


8100 
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A6ATATGTAG 


GGTAGCATCT 


TCCACCGCGG 


GTATAGTTCG 


TGCGAGGGAG 


CGAGGAGGTC 


CGGGCTGCTC 


TGCTCGGAAG 


ACTATCTGCC 


GATGATATGG 


TTGGACGCTG 


GAAGACGTTG 


TACCGCGTCA 


CGCACGAAGG 


AGGCGTAGGA 


GCTCGGCGGT 


GACCTGCACG 


TCTAGGGCGC 


ATGATGTCAT 


ACTTATCCTG 


TCCCTTTTTT 


GACAAACTCT 


TCGCGGTCTT 


TCCAGTACTC 


CCTCCGAACG 


GTAAGAGCCT 


AGCATGTAGA 


GCGCAGCATC 


CCTTTTCTAC 


GGGTAGCGCG 


GAGCGAGGTG 


TGGGTGAGCG 


CAAAGGTGTC 


ACTGGTATTT 


GAAGTCAGTG 


TCGTCGCATC 


AAGTCCGTGC 


GCTTTTTGGA 


ACGCGGATTT 


GTTGAAGAGT 


ATCTTTCCCG 


CGCGAGGCAT 


AGGGTCCCGG 


CACCTCGGAA 


CGGTTGTTAA 


ATCTCGTCAA 


AGCCGTTGAT 


GTTGTGGCCC 


GCGCGGGATG 


CCCTTGATGG 


AAGGCAATTT 


GCTCTTCAGG 


GGAGCTGAGC 


CCGTGCTCTG 


TGAGGGTTGG 


AAGCGACGAA 


TGAGCTCCAC 


TTGCAGGTGG 


TCGCGAAAGG 


TCCTAAACTG 


CTGGGGTGAT 


GCAGTAGAAG 


GTAAGCGGGT 


CCAAGGTTCG 


CGGCTAGGTC 


TCGCGCGGCA 


GCCGAACTTC 


ATGACCAGCA 


TGAAGGGCAC 


CCATCCAAGT 


ATAGGTCTCT 


ACATCGTAGG 


CGAGGATGCG 


AGCCGATCGG 


GAAGAACTGG 


GGAGTGGCTA 


TTGATGTGGT 


GAAAGTAGAA 
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ATGCTGGCGC 


GCACGTAATC 


8150 


GGGACCGAGG 


TTGCTACGGG 


8200 


TGAAGATGGC 


ATGTGAGTTG 


8250 


AAGCTGGCGT 


CTGTGAGACC 


8300 


GTCGCGCAGC 


TTGTTGACCA 


8350 


AGTAGTCCAG 


GGTTTCCTTG 


8400 


TTCCACAGCT 


CGCGGTTGAG 


8450 

W "* «^ w 


TTGGATCGGA 


AACCCGTCGG 

tulw w wv a \*WWJ 


8500 

W ^ W V 


ACTGGTTGAC 


GGCCTGGTAG 


8550 


TATGCCTGCG 


CGGCCTTCCG 


www 


CCTGACGATG 


ACTTTGAGGT 


8650 


CGCCCTGCTC 


CCAGAGCAAA 


8700 


GGCAGGGCGA 


AGGTGACATC 


8750 


AAAGTTGCGT 


GTGATGCGGA 


8800 


TTACCTGGGC 


GGCGAGCACG 


8850 


ACAATGTAAA 


GTTCCAAGAA 


8900 


TTTAAGTTCC 


TCGTAGGTGA 


8950 


AAAGGGCCCA 


GTCTGCAAGA 


9000 


AGGTCACGGG 


CCATTAGCAT 


9050 


GCGACCTATG 


GCCATTTTTT 


9100 


CTTGTTCCCA 


GCGGTCCCAT 


9150 


GTCACTAGAG 


GCTCATCTCC 


9200 


GAGCTGCTTC 


CCAAAGGCCC 


9250 


TGACAAAGAG 


ACGCTCGGTG 


9300 


ATCTCCCGCC 


ACCAATTGGA 


9350 


GTCCCTGCGA 


CGGGCCGAAC 


9400 
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ACTCGTGCTG 


GCTTTTGTAA 


AAACGTGCGC 


GGCTGTACAT 


CCTGCACGAG 


GTTGACCTGA 


GAGTGGGAAT 


TTGAGCCCCT 


CGCCTGGCGG 


CTTCGGCTGC 


TTGTCCTTGA 


CCGTCTGGCT 


GATCGGACCA 


CCACGCCGCG 


CGAGCCCAAA 


CGGTCGGAGC 


TTGATGACAA 


CATCGCGCAG 


GGAGCTCCCG 


CGGCGTCAGG 


TCAGGCGGGA 


CATAGACGGG 


TCAGGGCGCG 


GGCTAGATCC 


GGGCTGGTTG 


GTGGCGGCGT 


CGATGGCTTG 


GCGCGACTAC 


GGTACCGCGC 


GGCGGGCGGT 


GATGATGCAT 


CTAAAAGCGG 


TGACGCGGGC 


GGCTCCGGAC 


CCGCCGGGAG 


AGGGGGCAGG 


GGGCAGGAGC 


TGGTGCTGCG 


CGCGTAGGTT 


GGCGGTTGAT 


CTCCTGAATC 


TGGCGCCTCT 


GTGAGCTTGA 


GCCTGAAAGA 


GAGTTCGACA 


GACGGCGGCC 


TGGCGCAAAA 


TCTCCTGCAC 


AGGCGATCTC 


GGCCATGAAC 


TGCTCGATCT 


CGTCCGGCTC 


GCTCCACGGT 


GGCGGCGAGG 


GAGCTGCGAG 


AAGGCGTTGA 


GGCCTCCCTC 


CCACGCCCCC 


TTCGGCATCG 


CGGGCGCGCA 


AGCTCCACGT 


GCCGGGCGAA 


GACGGCGTAG 


GTAGTTGAGG 


GTGGTGGCGG 


TGTGTTCTGC 


AGCGTCGCAA 


CGTGGATTCG 


TTGATATCCC 


ATGGCCTCGT 


AGAAGTCCAC 


GGCGAAGTTG 


CGACACGGTT 


AACTCCTCCT 


CCAGAAGACG 


CGCGCACCTC 


GCGCTCAAAG 


GCTACAGGGG 
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nulnLiubWi 


GUGG 1 bwAtu 




CGACCGCGCA 


^»TA TV TV TV ^ f» 7\ 

GAAGGAAGCA 


QCAA 

youu 


GTTTGGCTGG 


IT10/^m^*l I nil/ till TV 

TGGTCTTCTA 


ncen 


GCTCGAGGGG 


AGTTACGGTG 


y ouu 


GTCCAGATGT 


CCGCGCGCGG 


Q £ C A 

9650 


ATGGGAGCTG 


TCCATGGTCT 


G*7 A A 

y / uu 


GCTCCTGCAG 


GTTTACCTCG 


y /du 


n s**/*ms* tv fmv r*ry 

AGGTGATACC 


TAATTT CCAG 


Q O A A 

y sou 


CAAGAGGCCG 


CATCCCCGCG 


COCA 

yo50 


GGGCCGCGGG 


GGTGTCCTTG 


n n n n 

9900 


GAGCCCCCGG 


AGGTAGGGGG 


n n c n 

9950 


GGCACGTCGG 


CGCCGCGCGC 


10000 


GCTGGCGAAC 


GCGACGACGC 


10050 


GCGTGAAGAC 


GACGGGCCCG 


10100 


GAATCAATTT 


CGGTGTCGTT 


10150 


GTCTCCTGAG 


TTGTCTTGAT 


i no a a 
10200 


CTTCCTCCTG 


GAGATCTCCG 


10250 


TCGTTGGAAA 


TGCGGGCCAT 


1 A*> A A 

10 300 


GTTCCAGACG 


CGGCTGTAGA 




TGACGACCTG 


CGCGAGATTG 


1U4UU 


TTTCGCAGGC 


GCTGAAAGAG 


XU4DU 


CACGAAGAAG 


TACATAACCC 


10500 


CCAAGGCCTC 


AAGGCGCTCC 


10550 


AAAAACTGGG 


AGTTGCGCGC 


10600 


GATGAGCTCG 


GCGACAGTGT 


10650 


CCTCTTCTTC 


TTCTTCAATC 


10700 
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TCCTCTTCCA 


TAAGGGCCTC 


CCCTTCTTCT 


A666666ACA 


CGGCGGCGAC 


GACGGCGCAC 


GCTCGATCAT 


CTCCCCGCGG 


CGACGGCGCA 


CCGTTCTCGC 


GGGGGCGCAG 


TTGGAAGACG 


ATGGGTTGGC 


GGGGGGCTGC 


CATGCGGCAG 


ATCTCAACAA 


TTGTTGTGTA 


GGTACTCCGC 


TCCGCATCGA 


CCGGATCGGA 


AAACCTCTCG 


ACAGTCGCAA 


GGTAGGCTGA 


GCACCGTGGC 


CGGGGTTGTT 


TCTGGCGGAG 


GTGCTGCTGA 


GTCTTGAGAC 


GGCGGATGGT 


CGACAGAAGC 


CTGCTGAATG 


CGCAGGCGGT 


CGGCCATGCC 


GGCGCAGGTC 


TTTGTAGTAG 


TCTTGCATGA 


TCTTCTCCTT 


CCTCTTGTCC 


1TGCATCTCTT 


GGCGGAGTTT 


GGCCGTAGGT 


GGCGCCCTCT 


CGAAGCCCCT 


CATCGGCTGA 


AGCAGGGCTA 


GCTAATATGG 


CCTGCTGCAC 


CTGCGTGAGG 


GTCCACAAAG 


CGGTGGTATG 


CGCCCGTGTT 


CCATAACGGA 


CCAGTTAACG 


GTCTGGTGAC 


TACCTGAGAC 


GCGAGTAAGC 


CCTCGAGTCA 


CCGCACCAGG 


TACTGGTATC 


CCACCAAAAA 


AGAGGGGCCA 


GCGTAGGGTG 


GCCGGGGCTC 


ATAAGGCGAT 


GATATCCGTA 


GATGTACCTG 


GGCGGTGGTG 


GAGGCGCGCG 


GAAAGTCGCG 


GCAGCGGCAA 


AAAGTGCTCC 


ATGGTCGGGA 


GCGCAATCGT 


TGACGCTCTA 


GACCGTGCAA 


CACTCTTCCG 


TGGTCTGGTG 


GATAAATTCG 
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TCTTCTGGCG GCGGTGGGGG 10750 

CGGGAGGCGG TCGACAAAGC 10800 

TGGTCTCGGT GACGGCGCGG 10850 

CCGCCCGTCA TGTCCCGGTT 10900 

GGATACGGCG CTAACGATGC 10950 

CGCCGAGGGA CCTGAGCGAG 11000 

AGAAAGGCGT CTAACCAGTC 11050 

GGGCGGCAGC GGGCGGCGGT 11100 

TGATGTAATT AAAGTAGGCG 11150 

ACCATGTCCT TGGGTCCGGC 11200 

CCAGGCTTCG TTTTGACATC 11250 

GCCTTTCTAC CGGCACTTCT 11300 

GCATCTATCG" CTGCGGCGGC 11350 

TCCTCCCATG CGTGTGACCC 11400 

GGTCGGCGAC AACGCGCTCG 11450 

GTAGACTGGA AGTCATCCAT 11500 

GATGGTGTAA GTGCAGTTGG 11550 

CCGGCTGCGA GAGCTCGGTG 11600 

AATACGTAGT CGTTGCAAGT 11650 

GTGCGGCGGC GGCTGGCGGT 11700 

CGGGGGCGAG ATCTTCCAAC 11750 

GACATCCAGG TGATGCCGGC 11800 

GACGCGGTTC CAGATGTTGC 11850 

CGCTCTGGCC GGTCAGGCGC 11900 

AAGGAGAGCC TGTAAGCGGG 11950 

CAAGGGTATC ATGGCGGACG 12000 
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ACCGGGGTTC 


GAGCCCCGTA 


TCCGGCCGTC 


CCGCCCGCGT 


GTCGAACCCA 


GGTGTGCGAC 


TCCTTTTGGC 


TTCCTTCCAG 


GCGCGGCGGC 


CCACTGGCCG 


CGCGCAGCGT 


AAGCGGTTAG 


AGTGGCTCGC 


TCCCTGTAGC 


CGGAGGGTTA 


GGGACCCCCG 


GTTCGAGTCT 


CGGACCGGCC 


TTGCCTCCCC 


GTCATGCAAG 


ACCCCGCTTG 


GACGAGCCCC 


TTTTTTGCTT 


TTCCCAGATG 


GCGCCCCCCT 


CCTCAGCAGC 


GGCAAGAGCA 


GGGCACCCTC 


CCCTCCTCCT 


ACCGCGTCAG 


GACGCGGCAG 


CAGATGGTGA 


TTACGAACCC 


CTACCTGGAC 


TTGGAGGAGG 


GCGAGGGCCT 


~~CTCCTGAGCG 


GTACCCAAGG 


GTGCAGCTGA 


TACGTGCCGC 


GGCAGAACCT 


GTTTCGCGAC 


GGAGATGCGG 


GATCGAAAGT 


TCCACGCAGG 


TGAATCGCGA 


GCGGTTGCTG 


CGCGAGGAGG 


ACCGGGATTA 


GTCCCGCGCG 


CGCACACGTG 


CGCATACGAG 


CAGACGGTGA 


ACCAGGAGAT 


ACAACCACGT 


GCGTACGCTT 


GTGGCGCGCG 


ATGCATCTGT 


GGGACTTTGT 


AAGCGCGCTG 


GCCGCTCATG 


GCGCAGCTGT 


TCCTTATAGT 


AGGCATTCAG 


GGATGCGCTG 


CTAAACATAG 


CTGCTCGATT 


TGATAAACAT 


CCTGCAGAGC 


CTTGAGCCTG 


GCTGACAAGG 


TGGCCGCCAT 


TGGGCAAGTT 


TTACGCCCGC 


AAGATATACC 


GACAAGGAGG 


TAAAGATCGA 


GGGGTTCTAC 
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tuLUu J. uAi 


UAluLuui Xi* 




fyvc a a & f* a a r* 

\a X WluiiUAAL 


V3\a\3\3\3n\2 X bU 


xz xuu 


lutluLuLlA 


uUi X 1 X X lw 


i oi en 
XZ XDU 


GCTGGAAAGC 


GAAAGCAX XA 


t 1 *> A A 

12200 


f llfllf lift 1 A"^1k » f+ f+ 

TTTTCCAAGG 


GTTGAGTCGC 


12250 


GGACTGCGGC 


GAACGGGGGT 


o o rt 

12300 


CAAATTCCTC 


CGGAAACAGG 


12350 


CATCCGGTGC 


TGCGGCAGAT 


12400 


AGAGCAGCGG 


CAG A C ATG CA 


12450 


6AGGG6CGAC 


ATCCGCGGTT 


12500 


CCGCGGCGCC 


GGGCCCGGCA 


12550 


GGCGCGGCTA 


GGAGCGCCCT 


12600 


AGCGTGATAC 


GCGTGAGGCG 


12650 


CGCGAGGGAG 


AGGAGCCCGA 


1 *T A A 

12700 


GCGCGAGCTG 


CGGCATGGCC 


12750 


ACTTTGAGCC 


CGACGCGCGA 


12800 


GCGGCCGCCG 


ik mi^i^mik ik a> 

ACCTGGTAAC 


*l O A e? A 

12850 


T AACTTT CAA 


H Ik Ik *lk /'IfllffflfTI Ik 

AAAAGCTTTA 


■1 A A A A 

12900 


AGGAGGTGG C 


TATAGGACTG 


4 A A C A 

12950 


GAGCAAAACC 


^» ik ik m% ^**^ik ik 

C AAAT AG CAA 


4 M MAM 

13000 


GGAGCACAGC 


AGGGACAACG 


13050 


TAGAGCCCGA 


GGGCCGCTGG 


13100 


ATAGTGGTGC 


AGGAGCGCAG 


13150 


CAACTATTCC 


ATGCTTAGCC 


13200 


ATACCCCTTA 


CGTTCCCATA 


13250 


ATGCGCATGG 


CGCTGAAGGT 


13300 
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GCTTACCTTG 


AGCGACGACC 


TGGGCGTTTA 


TCGCAACGAG 


CGCATCCACA 


13350 


A66CCGT6AG 


CGTGAGCCGG 


CGGCGCGAGC 


TCAGCGACCG 


CGAGCTGATG 


13400 


CACAGCCTGC 


AAAGGGCCCT 


GGCTGGCACG 


GGCAGCGGCG 


ATAGAGAGGC 


13450 


CGAGTCCTAC 


TTTGACGCGG 


GCGCTGACCT 


GCGCTGGGCC 


CCAAGCCGAC 


13500 


GCGCCCTGGA 


GGCAGCTGGG 


GCCGGACCTG 


GGCTGGCGGT 


GGCACCCGCG 


13550 


CGCGCTGGCA 


ACGTCGGCGG 


CGTGGAGGAA 


TATGACGAGG 


ACGATGAGTA 


13600 


CGAGCCAGAG 


GACGGCGAGT 


ACTAAGCGGT 


GATGTTTCTG 


ATCAGATGAT 


13650 


GCAAGACGCA 


ACGGACCCGG 


CGGTGCGGGC 


GGCGCTGCAG 

***7***7** A ******** 


AGCCAGCCGT 

**V* ************** A 


13700 


CCGGCCTTAA 


CTCCACGGAC 


GACTGGCGCC 


AGGTCATGGA 
**** ** ^ **#• * ** **** 


CCGGATCATG 


13750 


TCGCTGACTG 


CGCGCAATCC 


TGACGCGTTC 
^ * * **** ** ** ^ ^ ** 


CGGCAGCAGC 

****** **X*** ****** ** 


CGCAGGCCAA 

********** V* ** ****** 


13800 


CCGGCTCTCC 


GCAATTCTGG 


AAGCGGTGGT 
*■"»«* ****** « ** ** ^ 


CCCGGCGCGC 
********** **** ****** 


GCAAACCCCA 
** *** ** ** * ** ** ** * 


13850 


CGCACGAGAA 


GGTGCTGGCG 


ATCGTAAACG 


CGCTGGCCGA 


AAACAGGGCC 


13900 


ATCCGGCCCG 


ACGAGGCCGG 


CCTGGTCTAC 
** ** a ** ** a ** a *»** 


GACGCGCTGC^ 
******** **** ** a ** ** 


TT^GCGCGT 

db A ****** **** **> ** A 


13950~ 


GGCTCGTTAC 


AACAGCGGCA 


ACGTGCAGAC 


CAACCTGGAC 


CGGCTGGTGG 


14000 


GGGATGTGCG 


CGAGGCCGTG 


GCGCAGCGTG 


AGCGCGCGCA 


GCAGCAGGGC 


14050 


AACCTGGGCT 


CCATGGTTGC 


ACTAAACGCC 


TTCCTGAGTA 


CACAGCCCGC 


14100 


CAACGTGCCG 


CGGGGAGAGG 


AGGACTACAC 


CAACTTTGTG 


AGCGCACTGC 


14150 


GGCTAATGGT 


GACTGAGACA 


CCGCAAAGTG 


AGGTGTACCA 


GTCTGGGCCA 


14200 


GACTATTTTT 


TCCAGACCAG 


TAGACAAGGC 


CTGCAGACCG 


TAAACCTGAG 


14250 


CCAGGCTTTC 


AAAAACTTGC 


AGGGGCTGTG 


GGGGGTGCGG 


GCTCCCACAG 


14300 


GCGACCGCGC 


GACCGTGTCT 


AGCTTGCTGA 


CGCCCAACTC 


GCGCCTGTTG 


14350 


CTGCTGCTAA 


TAGCGCCCTT 


CACGGACAGT 


GGCAGCGTGT 


CCCGGGACAC 


14400 


ATACCTAGGT 


CACTTGCTGA 


CACTGTACCG 


CGAGGCCATA 


GGTCAGGCGC 


14450 


ATGTGGACGA 


GCATACTTTC 


CAGGAGATTA 


CAAGTGTCAG 


CCGCGCGCTG 


14500 


GGGCAGGAGG 


ACACGGGCAG 


CCTGGAGGCA 


ACCCTAAACT 


ACCTGCTGAC 


14550 


CAACCGGCGG 


CAGAAGATCC 


CCTCGTTGCA 


CAGTTTAAAC 


AGCGAGGAGG 


14600 
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AGCGCATTTT 


GCGCTACGTG 


CAGCAGAGCG 


6AC6GG6TAA 


CGCCCAGCGT 


GGCGCTGGAC 


ACCGGGCATG 


TATGCCTCAA 


ACCGGCCGTT 


ACTTGCATCG 


CGCGGCCGCC 


GTGAACCCCG 


TTGAACCCGC 


ACTGGCTACC 


GCCCCCTGGT 


GGTGCCCGAG 


GGTAACGATG 


GATTCCTCTG 


TGTTTTCCCC 


GCAACCGCAG 


ACCCTGCTAG 


GCAGAGGCGG 


CGCTGCGAAA 


GGAAAGCTTC 


CGATCTAGGC 


GCTGCGGCCC 


CGCGGTCAGA 


GCTTGATAGG 


GTCTCTTACC 


AGCACTCGCA 


GGCGAGGAGG 


AGTACCTAAA 


CAACTCGCTG 


AAACCTGCCT 


CCGGCATTTC 


CCAACAACGG 


AGATG AGTAG 


ATGGAAGACG 


TACGCGCAGG 


CCGCGCCCGC 


CCACCCGTCG 


TCAAAGGCAC 


GTGGGAGGAC 


GATGACTCGG 


CAGACGACAG 


GGAGTGGCAA 


CCCGTTTGCG 


CACCTTCGCC 


TAAAAAAAAA 


AAAGCATGAT 


GCAAAATAAA 


ACCGAGCGTT 


GGTTTTCTTG 


TATTCCCCTT 


GTATGAGGAA 


GGTCCTCCTC 


CCTCCTACGA 


CAGTGGCGGC 


GGCGCTGGGT 


TCTCCCTTCG 


TTTGTGCCTC 


CGCGGTACCT 


GCGGCCTACC 


TTACTCTGAG 


TTGGCACCCC 


TATTCGACAC 


ACAACAAGTC 


AACGGATGTG 


GCATCCCTGA 


AACTTTCTGA 


CCACGGTCAT 


TCAAAACAAT 


AAGCACACAG 


ACCATCAATC 


TTGACGACCG 


TGAAAACCAT 


CCTGCATACC 


AACATGCCAA 
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TV^ Af^PPTTA A 


WW JL \J£\ X W ww w 




ATYIAPPGPGP 


GPAAPATCfiA 


1 A700 


tatpa app^ip 

XriX WUiV«>wuU 


P*PA A*TV!L^£A P'P 
w X An X wwAw X 


In / 3U 


a mr 1 a Tiwpp a p 

AVJlnl X 1 w\rll— 




1 Aonn 


lit- XAUAULli 


p p pp 7k fwrpr * 
bubo AX ILbA 


14ojU 


p c* 7k r*r+ 7k r* 7k *t» 7k 


bALbAuAbLb 


1 il AAA 

X49U0 


ZIPTTPPA AP A 
Aw 1 1 b\«AALn 


PPPPPTiPPlkp 
bCbubnbwib 




wwVJAwwwt-AA 


f 07k ^^^'IPFPf HO 

bUAbLl IblL 


1 CAAA 

13UUU 


1 w U 1 Ala 1 AbL 


007krnrnmf»f*7k 7k 
bUAl 1 1 ULAA 


1 CACA 

1DUDU 




bbbUt 1 bU 1 \3 


i ci no 


w 1 w tJAw L. ww w 


Jk P PP PP 7k 7k 7k 7k 
AbbbtbAAAA 


1 CI CO 


wAlAwAGAGL. 


OfPIk f*fTlf*f* 7k OTk 

CTAGTGGACA 


X5200 


AGCACAGGGA 


CGTGCCAGGC 


«i c? a e a* 

15250 


GACCGTCAGC 


GGGGTCTGGT 


1 A A 

15300 


CAGCGTCCTG 


GATTTGGGAG 


15350 


CCAGGCTGGG 


GAGAATGTTT 


15400 


AAAL 1 WitWi 


7k P P P P 7k TV* O O 

AbbLuAXbbU 


1D4DU 


A nTaTCf^r 


finfinfirtnfz at 

uv#bLbbUunl 




c 7k nTrsTccm 

wrlvj XuX uu 1 w? 


APpr^pnppcp 




A tyip'ppppp'f 


P.P. A HHfflHHfl 


XjOUU 


CCCnnC A C A A 


APAP-PATPPP. 


X*>OjU 


CACCCGTGTG 


TACCTGGTGG 


15700 


ACTACCAGAA 


CGACCACAGC 


15750 


GACTACAGCC 


CGGGGGAGGC 


15800 


GTCGCACTGG 


GGCGGCGACC 


15850 


ATGTGAACGA 


GTTCATGTTT 


15900 
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ACCAATAAGT TTAAGGCGCG GGTGATGGTG TCGCGCTTGC CTACTAAGGA 15950 
CAATCAGGTG GAGCTGAAAT ACGAGTGGGT GGAGTTCACG CTGCCCGAGG 16000 
GCAACTACTC CGAGACCATG ACCATAGACC TTATGAACAA CGCGATCGTG 16050 
GAGCACTACT TGAAAGTGGG CAGACAGAAC GGGGTTCTGG AAAGCGACAT 16100 
CGGGGTAAAG TTTGACACCC GCAACTTCAG ACTGGGGTTT GACCCCGTCA 16150 
CTGGTCTTGT CATGCCTGGG GTATATACAA ACGAAGCCTT CCATCCAGAC 16200 
ATCATTTTGC TGCCAGGATG CGGGGTGGAC TTCACCCACA GCCGCCTGAG 16250 
CAACTTGTTG GGCATCCGCA AGCGGCAACC CTTCCAGGAG GGCTTTAGGA 16300 
TCACCTACGA TGATCTGGAG GGTGGTAACA TTCCCGCACT GTTGGATGTG 16350 
GACGCCTACC AGGCGAGCTT GAAAGATGAC ACCGAACAGG GCGGGGGTGG 16400 
CGCAGGCGGC AGCAACAGCA GTGGCAGCGG CGCGGAAGAG AACTCCAACG 16450 
CGGCAGCCGC GGCAATGCAG CCGGTGGAGG ACATGAACGA TCATGCCATT 16500 
CGCGGCGACA CCTTTGCCAC ACGGGCTGAG~ GAGAAGCGCG CTGAGGCCGA 16550 
AGCAGCGGCC GAAGCTGCCG CCCCCGCTGC GCAACCCGAG GTCGAGAAGC 16600 
CTCAGAAGAA ACCGGTGATC AAACCCCTGA CAGAGGACAG CAAGAAACGC 16650 
AGTTACAACC TAATAAGCAA TGACAGCACC TTCACCCAGT ACCGCAGCTG 16700 
GTACCTTGCA TACAACTACG GCGACCCTCA GACCGGAATC CGCTCATGGA 16750 
CCCTGCTTTG CACTCCTGAC GTAACCTGCG GCTCGGAGCA GGTCTACTGG 16800 
TCGTTGCCAG ACATGATGCA AGACCCCGTG ACCTTCCGCT CCACGCGCCA 16850 
GATCAGCAAC TTTCCGGTGG TGGGCGCCGA GCTGTTGCCC GTGCACTCCA 16900 
AGAGCTTCTA CAACGACCAG GCCGTCTACT CCCAACTCAT CCGCCAGTTT 16950 
ACCTCTCTGA CCCACGTGTT CAATCGCTTT CCCGAGAACC AGATTTTGGC 17000 
GCGCCCGCCA GCCCCCACCA TCACCACCGT CAGTGAAAAC GTTCCTGCTC 17050 
TCACAGATCA CGGGACGCTA CCGCTGCGCA ACAGCATCGG AGGAGTCCAG 17100 
CGAGTGACCA TTACTGACGC CAGACGCCGC ACCTGCCCCT ACGTTTACAA 17150 
GGCCCTGGGC ATAGTCTCGC CGCGCGTCCT ATCGAGCCGC ACTTTTTGAG 17200 
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CAAGCATGTC 


CATCCTTATA 


TCGCCCAGCA 


CGCTTCCCAA 


GCAAGATGTT 


TGGCGGGGCC 


CCCAGTGCGC 


GTGCGCGGGC 


ACTACCGCGC 


GCGGCCGCAC 


TGGGCGCACC 


ACCGTCGATG 


GAGGAGGCGC 


GCAACTACAC 


GCCCACGCCG 


CGCGGCCATT 


CAGACCGTGG 


TGCGCGGAGC 


AGAGACGGCG 


GAGGCGCGTA 


GCACGTCGCC 


GCCGCCCAAC 


GCGCGGCGGC 


GGCCCTGCTT 


CCGACGGGCG 


GCCATGCGGG 


CCGCTCGAAG 


CTGTGCCCCC 


CAGGTCCAGG 


CGACGAGCGG 


ATTAGTGCTA 


TGACTCAGGG 


TCGCAGGGGC 


CTCGGTTAGC 


GGCCTGCGCG 


TGCCCGTGCG 


AGATTGCAAG 


AAAAAACTAC 


TTAGACTCGT 


GCGGCGGCGC 


GCAACGAAGC 


TATGTCCAAG 


GCTCCAGGTC 


ATCGCGCCGG 


AGATCTATGG 


AGGATTACAA 


GCCCCGAAAG 


CTAAAGCGGG 


GATGATGATG 


AACTTGACGA 


CGAGGTGGAA 


CAGGCGACGG 


GTACAGTGGA 


AAGGTCGACG 


CCGGCACCAC 


CGTAGTCTTT 


ACGCCCGGTG 


AAGCGCGTGT 


ATGATGAGGT 


GTACGGCGAC 


CAACGAGCGC 


CTCGGGGAGT 


TTGCCTACGG 


TGGCGTTGCC 


GCTGGACGAG 


GGCAACCCAA 


ACACTGCAGC 


AGGTGCTGCC 


CGCGCTTGCA 


CCTAAAGCGC 


GAGTCTGGTG 


ACTTGGCACC 


CCAAGCGCCA 


GCGACTGGAA 


GATGTCTTGG 


GGGCTGGAGC 


CCGAGGTCCG 


CGTGCGGCCA 
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ATAAGAGAGG 


CTGGGGCCTG 


1 to en 


AAGAAGCGCT 


MMM Jk MM 7k Jk p« 

CCGACCAACA 


1 *7 0 AA 

1 / JUU 


GCCCTGGGGC 


M MM Mj Jk M m » Ik M 

GCGGAGAAAC 


X / JDU 


ACGCCATCGA 


MM MM M PTtM M (TIM 

CGCGGTGGTG 


1 Tit Ar\ 
X74UU 


CCACCAGTGT 


mm ^ m% MmM^iih 

CCACAGTGGA 


"1 *T A tZ A 

17450 


CCGGCGCTAT 


GCTAAAATGA 


1 T C Art 

17500 


* 

ACCGCCGCCG 


ACCCGGCACT 


X7550 


•% *lk M.MM MM f^Ti Mj 

AACCGCGCAC 


GTCGCACCGG 


i *t ^ ^ a 
17600 


GCTGGCCGCG 


GGTATTGTCA 


17650 


CCGCCGCAGC 


AGCCGCGGCC 


17700 


AACGTGTATT 


GGGTGCGCGA 


17750 


CACCCGCCCC 


CCGCGCAACT 


17800 


ACTGTTGTAT 


GTATCCAGCG 


17850 


CGCAAAATCA 


AAGAAGAGAT 


17900 


CCCCCCGAAG 


AAGGAAGAGC 


17950 


TCAAAAAGAA 


AAAGAAAGAT 


18000 


CTGCTGCACG 


GTACCGCGCC 


1 O AC A 

18050 


CGTAAAACGT 


GTTTTGCGAC 


18100 


AGCGCTCCAC 


MMM "» M/ III I* M 

CCGCACCTAC 


18150 


m « mm m AranM m 

GAGGACCTGC 


fT*fTlM » M M^V ^ ^ 

TTGAGGAGGC 


18200 


« v « m mm m mh rn 

AAAGCGGCAT 


» MM 7\ M"» 

AAGGAGATG C 


1 0 O C f\ 

18250 


CACCTAGCCT 


AAAGCCCGTA 


18300 


CCGTCCGAAG 


AAAAGCGCGG 


18350 


CACCGTGCAG 


CTGATGGTAC 


18400 


AAAAAATGAC 


CGTGGAACCT 


18450 


ATCAAGCAGG 


TGGCGCCGGG 


18500 



WO 96/39530 




88 

ACTGGGC6TG CAGACCGTGG ACGTTCAGAT 
GTATTGCCAC CGCCACAGAG GGCATGGAGA 
TCAGCGGTGG CGGATGCCGC GGTGCAGGCG 
GACCTCTACG GAGGTGCAAA CGGACCCGTG 
CCCGGCGCCC GCGCGGTTCG AGGAAGTACG 
CCCGAATATG CCCTACATCC TTCCATTGCG 
CTACACCTAC CGCCCCAGAA GACGAGCAAC 
CTGGAACCCG CCGCCGCCGT CGCCGTCGCC 
TCCGTGCGCA GGGTGGCTCG CGAAGGAGGC 
AGCGCGCTAC CACCCCAGCA TCGTTTAAAA 
CAGATATGGC CCTCACCTGC CGCCTCCGTT 
GGAAGAATGC ACCGTAGGAG GGGCATGGCC 
CATGCGTCGT GCGCACCACC GGCGGCGGCG 
GCGGCGGTAT CCTGCCCCTC CTTATTCCAC 
GCCGTGCCCG GAATTGCATC CGTGGCCTTG 
AAAAACAAGT TGCATGTGGA AAAATCAAAA 
GCTCGCTTGG TCCTGTAACT ATTTTGTAGA 
GTCTCTGGCC CCGCGACACG GCTCGCGCCC 
ATATCGGCAC CAGCAATATG AGCGGTGGCG 
TGGAGCGGCA TTAAAAATTT CGGTTCCACC 
GGCCTGGAAC AGCAGCACAG GCCAGATGCT 
AAAATTTCCA ACAAAAGGTG GTAGATGGCC 
GTGGTGGACC TGGCCAACCA GGCAGTGCAA 
TGATCCCCGC CCTCCCGTAG AGGAGCCTCC 
CTCCAGAGGG GCGTGGCGAA AAGCGTCCGC 
CTGGTGACGC AAATAGACGA GCCTCCCTCG 
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ACCCACTACC 


AGTAGCACCA 

nw x nwvnvwn 


18550 


CACAAACGTC 


WW Www X X W WW 


XO W W 


GTCGCTGCGG 

*J X W W 7 W X w V*\J %J 


CCGCGTPP A A 
w Ww Ww x wwin 


xo o*s V 


G ATGTTT PGP 

wXlXwX X X WwW 


GTTTPAGPPP 

w X X X WX1.W W W W 


1 8700 


GCGPPGPPAG 

wWwwWwWWAw' 


Wwwl7wX/lw X V? 


XO / D\J 


PPTAPPPPPG 
wUl Awwwwww 


wwXaX Lwluu 


1 PQftfl 

xoouu 


ma nnnrt A pcp 


PflA'APPAPPA 


XooDU 


nuU*ULu X Ijw X 


rir* c*r*r*r*f* tl hit 
buLLLLuAl X 






X w*w X V» w UlAC 


1QQCA 


bULublLl X X 




19000 


X w ww GG X uLt 


GGGATTCCGA 


19050 


GGwwlCGGCC 


TGACGGGCGG 


19100 


\JGCGXCGCAC 


CGTCGCATGC 


19150 


X uAX UuLUuL 


bbtbAl IbbU 


192O0 


Lnuu wl? LnuA 


VxrV wii w X Vjrii X X 




T A A A A A CTPT 1 
X niuuiau lux 


o VjAw X wX wi w 




ATGGAAGAPA 


*PPA A PTT'l'nP 
X wxuiw XXX or w 




fJTTPATfZfZf^A 
wX lV*AXuu\7n 


A APT^IfZPA Afl 
Arlw X uuwinb 




PP f PTPACP f PfI 

WW x x Wiww X W 


uubw X wVa w X V» 




GTTAAGAACT 


ATGGPAGPA A 
a x w w wr\w win 




GAGGGATAAG 


TTGAA AfZArtP 




TGGCCTCTGG 


CATTAGCGGG 


19600 


AATAAGATTA 


ACAGTAAGCT 


19650 


ACCGGCCGTG 


GAGACAGTGT 


19700 


GCCCCGACAG 


GGAAGAAACT 


19750 


TACGAGGAGG 


CACTAAAGCA 


19800 
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AGGCCTGCCC ACCACCCGTC CCATCGCGCC 
GCCAGCACAC ACCCGTAACG CTGGACCTGC 
CAGAAACCTG TGCTGCCAGG CCCGACCGCC 
CCGCGCGTCC CTGCGCCGCG CCGCCAGCGG 
TAGCCAGTGG CAACTGGCAA AGCACACTGA 
GTGCAATCCC TGAAGCGCCG ACGATGCTTC 
TGTGTGTCAT GTATGCGTCC ATGTCGCCGC 
CGCGCGCCCG CTTTCCAAGA TGGCTACCCC 
CTTACATGCA CATCTCGGGC CAGGACGCCT 
CTGGTGCAGT TTGCCCGCGC CACCGAGACG 
GTTTAGAAAC CCCACGGTGG CGCCTACGCA 
CCCAGCGTTT GACGCTGCGG TTCATCCCTG 
TACTCGTACA AGGCGCGGTT CACCCTAGCT 
GGACATGGCT TCCACGTACT TTGACATCCG 
CTACTTTTAA GCCCTACTCT GGCACTGCCT 
GGTGCCCCAA ATCCTTGCGA ATGGGATGAA 
AAACCTAGAA GAAGAGGACG ATGACAACGA 
CTGAGCAGCA AAAAACTCAC GTATTTGGGC 
AATATTACAA AGGAGGGTAT TCAAATAGGT 
ATATGCCGAT AAAACATTTC AACCTGAACC 
GGTACGAAAC TGAAATTAAT CATGCAGCTG 
ACCCCAATGA AACCATGTTA CGGTTCATAT 
TGGAGGGCAA GGCATTCTTG TAAAGCAACA 
AAGTGGAAAT GCAATTTTTC TCAACTACTG 
GATAACTTGA CTCCTAAAGT GGTATTGTAC 
AACCCCAGAC ACTCATATTT CTTACATGCC 
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P 2k Tfi (2 PT 2k P P 


cnjvmfftPTGG 

wwXVU X WW X ww 


19850 




Pf22kP2kPPP2kC 

wwAwAwwwXlw 


1 QQOO 




pppp f ppp r T i 7An 

tttu X ww X Al? 


i QQ^n 


X w wG wGAX wG 


nrwpp ppp pppc 


9 nnnn 
z uuuu 


ACAGCATCGT 


GGG it X GGGG 


ZUUDU 


TGAATAGCTA 


AwGXGX wGXA 






P PT/2 & fSPPflP 
\3 w X uAu w w V? w 


a UIjU 


X X CGAX GA X G 


LLbtAu X bi? X 




i**r+r* TV /^TTV PPTi 


P 7A PPPPP/2/in 
w'Awwwwwww'w' 




rriTV /^rprpOTV 
lAUlXUAbUt 


TP & 2k T2k 2k PA A 


a UjvU 


CGACGTGACC 


TV Oft P TV OOf^n"! 

Aw AG Aw L.GG X 




X GGAC wG X GA 


bbAlAL X btb 


nn Ann 
£ U4UU 


GTGGGTGATA 


AC CG I G X G C X 




CGGCGTGCTG 


^ tv otv r*f*r*r*r*c* 
GAwAGGGGww 


OACA/V 

^UDUU 


TV f» TV TV prppprn 
ALAALbLLL X 


r 1 p r*f*PP P Ik 2k P 
bbL X UULAAb 




^^^n^^ ^^^mfc ^^^n^^ 

G CTGCTAC X G 


wX wX ibAAAl 


6UOUU 


TV TV /"»/■* TV IV PflP IV 

AGA wGAAG X A 


TV r*f* TV pOTV TV 
bAbbAb bAAb 




AGGCGGCX XA 


llbibbiAlA 


n n*7nn 


^m^^ iv tv ^^m^> 

GTCGAAGGTC 


TV TV TV PTA /^/"irTITV TV 

AAA CA w C X AA 


O n*7KA 
/ DU 


m^iv tv iv m tv tv 

TCAAATAGGA 


^ TV TV rn/^**T»OTA 
bAAivl LAb X 


OftQAA 

ZUoUU 


GGAGAG X ww X 


m TV TV TV TV TV P 7A PHP 
X AAAAAbAb X 




GCAAAACCCA 


CAAATGAAAA 


20900 


AAATGGAAAG 


CTAGAAAGTC 


20950 


AGGCGACC6C 


AGGCAATGGT 


21000 


AGTGAAGATG 


TAGATATAGA 


21050 


CACTATTAAG 


GAAGGTAACT 


21100 
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CACGAGAACT 


AATGGGCCAA 

***** wwuvwm 


P A ATPT ITfiP 


ppta ivpzvpppp 


fir 7A tv nnrp tv TV ^^rn 
1 AAX x AC ATT 


4 en 

211D0 


GCTTTTAGGG 


ACAATTTTAT 


tgp/tptzi amr 


T JV TT 2V P a IV P 2v 
X AX XAUAAUA 


PPTAOPPPIflTi TV 

GGACGGGTAA 


21200 


TATGGGTGTT 


CTGGCGGGPP 


A AGPATPP.PA 


P 1 1 U 1 5V IV fPP / tl M 


G 1 X G TAG ATT 


21250 


TGCAAGACAG 


AAACACAGAG 

****** V*X* ^^Xjh VJXa VJ 


ptttp at a pp 

will wilAvv 


a p /■'nirniiwi / »i p 
nuLll X luUl 


IVjAIILLAI X 


21300 


GGTGATAGAA 


CCAGGT A PTT 


TTPTATP.TPP 


TV TV*"PP7V PPPT»P 
AAX (JAGGC X G 


f 1 if lip 7\ p TV ppm » 

X X GACAGCTA 


21350 


TGATCCAGAT 


GTT AG A ATT7* 


rnm/^ » » » » rnpTv 


cppp ta iv pmp tv > 
TGGAACTGAA 


GATGAACTTC 


21400 


CAAATTACTG 


CTTTCCACTG 


IjG AGG X G 1 GA 


rnrpTA TV m TV P TV P T\ 

TTAATACAGA 


P TV Plf IPIllll'tlV 

GACTCTTACC 


21450 


AA6GTAAAAC 


CTAAAACAGG 


TCAGGAAAAT 


GGATGGGAAA 


AAGATGCTAC 


21500 


AGAATTTTCA 


GATAAAAATG 


AAATAAGAGT 


TGGAAATAAT 


TTTGCCATGG 


21550 


AAATCAATCT 


AAATGCCAAC 


CTGTGGAGAA 


m mmtw\*m **m*mrw* ^fc 

ATTTCCTGTA 


CTCCAACATA 


21600 


GCGCTGTATT TGCCCGACAA 


GCTAAAGTAC 


AGTCCTTCCA 


ACGTAAAAAT 


21650 


TTCTGATAAC 


CCAAACACCT 


n PP* pi^tv /ii m 

ACGACTACAT 


GAACAAGCGA 


GTGGTGGCTC 


21700 


CCGGGTTAGT 


GGACTGCTAC 


ATTAACCTTG 


*PV ^ *WA m **« ^mmm* *m 

GAGCACGCTG 


GTCCCTTGAC 


21750 


TATATGGACA 


ACGTCAACCC 


TV f 1 U 1 II 1 1 TV TV /*/*T» 

ATTTAACCAC 


CACCGCAATG 


CTGGCCTGCG 


21800 


CTACCGCTCA 


ATGTTGCTGG 


pp?v tv mr*^*m^>/^ 
GCAATGGTCG 


/^wi* m/^m/« ****** 

CTATGTGCCC 


TTCCACATCC 


21850 


AGGTGCCTCA 


GAAGTTCTTT 


P.PP2V TT1V TV TV 7V 

oLLAl 1 AAAA 


ACCTCCTTCT 


CCTGCCGGGC 


21900 


TCATACACCT 


ACGAGTGGAA 


PTTPaC^a TAP 


P JA rn/Trii"p TV TV P TV 
bA ibl X AACA 


TGGTTCTGCA 


21950 


GAGCTCCCTA 


GGAAATGACC 


T A A GGGTTP. A 


ppp tv f^r^Ticr* 


TV mm tv TV prnmmp 

AX 1AAGTTTG 


*\ *\ r\ r\ r\ 

22000 


ATAGCATTTG 


CCTTTACGCC 


AV^Ui X LI X tv 


LUA1 GGCCCA 


PT\ TV P TV />^^l/ a i^« 

CAACACCGCC 


^ ^ r\ e f\ 

22050 


TCCACGCTTG 


AGGCCATGCT 


TAfIA A APP.7AP 


71 PP A 7A PP TV PP 
ALLAALbALv' 


AGTCCTTTAA 


22100 


CGACTATCTC 


TCCGCCGCCA 


A P A TP PT PT 2V 


PPpmTv cptv ppp 


pppti tv r*f+ tv 

GCCAACGCTA 


22150 


CCAACGTGCC 


CATATCCATC 


CCCTCCCGCA 


ACTGGGCGGC 


TTTGCGCGGC 


22200 


TGGGCCTTCA 


CGCGCCTTAA 


GACTAAGGAA 


ACCCCATCAC 


TGGGCTCGGG 


22250 


CTACGACCCT 


TATTACACCT 


ACTCTGGCTC 


TATACCCTAC 


CTAGATGGAA 


22300 


CCTTTTACCT 


CAACCACACC 


TTTAAGAAGG 


TGGCCATTAC 


CTTTGACTCT 


22350 


TCTGTCAGCT 


GGCCTGGCAA 


TGACCGCCTG 


CTTACCCCCA 


ACGAGTTTGA 


22400 
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AATTAAGCGC 


TCAGTTGACG 


GGGAGGGTTA 


TGACCAAAGA 


CTGGTTCCTG 


GTACAAATGC 


TACCAGGGCT 


TCTATATCCC 


AGAGAGCTAC 


CTTTAGAAAC 


TTCCAGCCCA 


TGAGCCGTCA 


ACAAGGACTA 


CCAACAGGTG 


GGCATCCTAC 


TTTGTTGGCT 


ACCTTGCCCC 


CACCATGCGC 


TAACTTCCCC 


TATCCGCTTA 


TAGGCAAGAC 


AGAAAAAGTT 


TCTTTGCGAT 


CGCACCCTTT 


AACTTTATGT 


CCATGGGCGC 


ACTCACAGAC 


CGCCAACTCC 


GCCCACGCGC. 


TAGACATGAC 


ACGAGCCCAC 


CCTTCTTTAT 


GTTTTGTTTG 


GTGCACCGGC 


CGCACCGCGG 


CGTCATCGAA 


CTTCTCGGCC 


GGCAACGCCA 


CAACATAAAG 


AGCTGCCGCC 


ATGGGCTCCA 


GTGAGCAGGA 


ATCTTGGTTG 


TGGGCCATAT 


TTTTTGGGCA 


GGCTTTGTTT 


CTCCACACAA 


GCTCGCCTGC 


TCGCGAGACT 


GGGGGCGTAC 


ACTGGATGGC 


CAAAAACATG 


CTACCTCTTT 


GAGCCCTTTG 


AAGCAGGTTT 


ACCAGTTTGA 


GTACGAGTCA 


TGCTTCTTCC 


CCCGACCGCT 


GTATAACGCT 


TACAGGGGCC 


CAACTCGGCC 


GCCTGTGGAC 


CACGCCTTTG 


CCAACTGGCC 


CCAAACTCCC 


GAACCTTATT 


ACCGGGGTAC 


CCAACTCCAT 


AGCCCACCCT 


GCGTCGCAAC 


CAGGAACAGC 


CACTCGCCCT 


ACTTCCGCAG 


CCACAGTGCG 


TTTTTGTCAC 


TTGAAAAACA 


TGTAAAAATA 
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/■•TV 7V fprpo^r* 


PTi^m^rnTi tv f> 7v 

CAGTGTAACA 




T»7Af PT»7A n f rpTv 


^TV 7A OTA rwp/*^^» 




a a rr a nnrr* a 


lbXALlLUl 1 






CATAPT 1 A A AT 


£ & o w 


A PP A A P & p a A 


pa a r r rr r rfid A 

Wui\# 19UA 


& & O 




LUlALttKit 


/UU 


CGCAGTTGAC 


AGCATTACCC 


22750 


GGCGCATCCC 


ATTCTCCAGT 


22800 


CTGGGCCAAA 


ACCTTCTCTA 


22850 


TTTTGAGGTG 


GATCCCATGG 


22900 


- s . 

AAGTCTTTGA 


CGTGGTCCGT 


229DO 




X bLijUAUuUL 




AAGCAAGCAA 


/-i tv mnx tv o tv iv 

CAT C AAC AAC 


23Q50 


ACTGAAAGCC 


ATTGTCAAAG 


23100 


CCTATGACAA 


GCGCTTTCCA 


23150 


GCCATAGTCA 


ATACGGCCGG 


23200 


CTTTGCCTGG 


IV IV ^^^^^*^TV Still 

AACCCGCACT 


O O O C f\ 

23250 


GCTTTTCTGA 








O ffi 74 f* f*f* P P 7A T* 
blAbLbULAl 




/^j^ TV TV TV TV f*TT\r*f^ 

GGAAAAGTCC 


» or»/^7A 71 7APPP 




TATTCTGCTG 


CATGTTTCTC 




ATGGATCACA 


ACCCCACCAT 


23500 


GCTCAACAGT 


CCCCAGGTAC 


23550 


TCTACAGCTT 


CCTGGAGCGC 


23600 


CAGATTAGGA 


GCGCCACTTC 


23650 


ATGTACTAGA 


GACACTTTCA 


23700 
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ATAAAGGCAA 


ATGCTTTTAT 


TTGTACACTC 


TCGGGTGATT 


ATTTACCCCC 


23750 


ACCCTTGCCG 


TCTGCGCCGT 


TTAAAAATCA 


AAGGGGTTCT 


GCCGCGCATC 


23800 


GCTATGCGCC 


ACTGGCAGGG 


ACACGTTGCG 


ATACTGGTGT 


TTAGTGCTCC 


23850 


ACTTAAACTC 


AGGCACAACC 


ATCCGCGGCA 


GCTCGGTGAA 


GTTTTCACTC 


23900 


CACAGGCTGC 


GCACCATCAC 


CAACGCGTTT 


AGCAGGTCGG 


GCGCCGATAT 


23950 


CTTGAAGTCG 


CAGTTGGGGC 


CTCCGCCCTG 


CGCGCGCGAG 


TTGCGATACA 


24000 


CAGGGTTGCA 


GGACTGGAAC 


ACTATCAGCG 


CCGGGTGGTG 

********** m± *W** m± %* 


CACGCTGGCC 
********** **** **** 


24050 


AGCACGCTCT 

nwvnwww x wx 


TGTCGGAGAT 

X w X VV7VAUA X 


CAGATCCGPG 


TCCAGGTCCT 

X W **X»\3 W X W ** X 


CPGCGTTGCT 

WWW WW X X WW X 


24100 

m* 1 X W W 


CAGGGCGAAC 


GGAGTCAAOT 


TTGGTAGCTG 


CCTTCCCAAA 

wwx x www«*xr* 


AAGGGCGCGT 

«^f*WW/W W*7W*7 X 


24150 

4& ~ X. w V# 


GCCCAGGCTT 


TGAGTTGCAC 


TCGCACCGTA 


GTGGCATCAA 


AAGGTGACCG 


24200 


TGCCCGGTCT 


GGGCGTTAGG 


ATACAGCGCC 


TGCATAAAAG 

x ** %*a ^ A CMWlW 


CCTTGATCTG 

^b ^b A A ^b ^b 


24250 


CTTAAAAGCC 


ACCTGAGCCT 


TTGCGCCTTC 


AGAGAAGAAC 


ATGCCGCAAG 


24300 


ACTTGCCGGA 


AAACTGATTG 


GCCGGACAGG 


CCGCGTCGTG 

****\*T**** X ***» » ** 


CACGCAGCAC 


24350 


CTTGCGTCGG 


TGTTGGAGAT 

^b ^* ^b ^b ^m-m * ^b 


CTGCACCACA 

m± ** ****** ******** 


TTTCGGCCCC 

X X * ****** ******** 


ACCGGTTCTT 

mV A ^b ^b ^b ^b 


24400 


CACGATCTTG 


GCCTTGCTAG 


ACTGCTCCTT 


CAGCGCGCGC 


TGCCCGTTTT 

^b %^ ^b ^b ^b ^b 


24450 


CGCTCGTCAC 


ATCCATTTCA 

AAA ^V^^Av^b ^b ^b ^^f3b 


ATCACGTGCT 

** X ««nVW X **** X 


CCTTATTTAT 

****x x**x x x n x 


C ATAATG CTT 


24500 


CCGTGTAGAC 


ACTTAAGCTC 


GCCTTCGATC 


TCAGCGCAGC 


GGTGCAGCCA 


24550 


CAACGCGCAG 


CCCGTGGGCT 


' CGTGATGCTT 

** ** A *** * A ** ** m* m± 


GTAGGTCACC 

X **>* *» X >***** ** 


TCTGCAAACG 


24600 


ACTGCAGGTA 


CGCCTGCAGG 


AATCGCCCCA 


TCATCGTCAC 


AAAGGTCTTG 

* AA M **** ^b ^b ^b 


24650 


TTGCTGGTGA 


AGGTCAGCTG 


CAACCCGCGG 


TGCTCCTCGT 


TCAGCCAGGT 


24700 


CTTGCATACG 


GCCGCCAGAG 


CTTCCACTTG 


GTCAGGCAGT 


AGTTTGAAGT 


24750 


TCGCCTTTAG 


ATCGTTATCC 


ACGTGGTACT 


TGTCCATCAG 


CGCGCGCGCA 


24800 


GCCTCCATGC 


CCTTCTCCCA 


CGCAGACACG 


ATCGGCACAC 


TCAGCGGGTT 


24850 


CATCACCGTA 


ATTTCACTTT 


CCGCTTCGCT 


GGGCTCTTCC 


TCTTCCTCTT 


24900 


GCGTCCGCAT 


ACCACGCGCC 


ACTGGGTCGT 


CTTCATTCAG 


CCGCCGCACT 


24950 


GTGCGCTTAC 


CTCCTTTGCC 


ATGCTTGATT 


AGCACCGGTG 


GGTTGCTGAA 


25000 
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ACCCACCATT 


TGTAGCGCCA 


CATCTTCTCT 


TTACCTCTGG 


TGATGGCGGG 


CGCTCGGGCT 


TTCTTCTTGG 


GCGCAATGGC 


CAAATCCGCC 


GCTGGGTGTG 


CGCGGCACCA 


GCGCGTCTTG 


CGGACTCGAT 


ACGCCGCCTC 


ATCCGCTTTT 


GGCGGCGACG 


GGGACGGGGA 


CGACACGTCC 


CGCCGCACCG 


CGTCCGCGCT 


CGGGGGTGGT 


GACTGGCCAT 


TTCCTTCTCC 


TATAGGCAGA 


GAGAAGAAGG 


ACAGCCTAAC 


CGCCCCCTCT 


CACCGATGCC 


GCCAACGCGC 


CTACCACCTT 


TTGAGGAGGA 


GGAAGTGATT 


ATCGAGCAGG 


GACGACGAGG 


ACCGCTCAGT 


ACCAACAGAG 


CAACGCAGAG 


GCAAACGAGG 


AACAAGTCGG 


GCGACTACCT 


AGATGTGGGA 


GACGACGTGC 


CAGTGCGCCA 


TTATCTGCGA 


CGCGTTGCAA 


CGCCATAGCG 


GATGTCAGCC 


TTGCCTACGA 


GCGTACCCCC 


CAAACGCCAA 


GAAAACGGCA 


CTCAACTTCT 


ACCCCGTATT 


TGCCGTGCCA 


CATCTTTTTC 


CAAAACTGCA 


AGATACCCCT 


GCCGAGCGGA 


CAAGCAGCTG 


GCCTTGCGGC 


ATCGCCTCGC 


TCAACGAAGT 


GCCAAAAATC 


CGAGAAGCGC 


GCGGCAAACG 


CTCTGCAACA 


GTCACTCTGG 


AGTGTTGGTG 


GAACTCGAGG 


GTACTAAAAC 


GCAGCATCGA 


GGTCACCCAC 


CCTACCCCCC 


AAGGTCATGA 


GCACAGTCAT 


GTGCGCAGCC 


CCTGGAGAGG 


GATGCAAATT 




PCT/US96/I024S 



llvlX WW X W»u 




95050 


X V» Av» AAvsV? 


bwuw lXWilX 


95100 


GCUGAGG J. CG 


AXtroWUvUUO 




TGATGAGTCT 


TCCTCGTCCT 




TTGGGGGCGC 


CCGGGGAGGC 


ZDZDU 


TCCATGGTTG 


GGGGACG 1 CG 




X X LbLbUl uw 






TV TV TV TV 7V m/^TV fW 

AAAAGAX uax 


ubAu X Uiu X W 






l*L»Al*l#UlA*X w 


O C4CA 


CCCCGTCGAG 




£jOUU 


ACCCAGGTTT 


m^m% % ^*^^tv iv 

TGTAAGCGAA 


o e c c a 
25550 


GATAAAAAGC 


TV TV TV f>/"*TVf*f*TV 

AAGACCAGGA 


ZOOlIU 


GCGGGGGGAC 


if tv tv Tv^^o7vrrv» 
GAAAGGCATG 




TGTTGAAGCA 


TCTGCAGCGC 


ZD /UU 


GAGCGGAGCG 


AX G X btCLL X 


ZD / DU 


ACGCCACCTA 


TTCTCACCGC 


o conn 


CATGCGAGCC 


CAACCCGCGC 


zDoDU 


GAGGTGu 1 I G 


pnii pom TV mnx 


nconn 

ZD7UU 


ATCCTGCCGT 


GCCAACCGCA 


ZDyDU 


AGGGCG CTGT 


o iv m n pr>fpr tv m 


zouuu 




X X bbALbLbA 


Z Ov/DU 


GGAAAACAGC 


GAAAATGAAA 


26100 


GTGACAACGC 


GCGCCTAGCC 


26150 


TTTGCCTACC 


CGGCACTTAA 


26200 


GAGTGAGCTG 


ATCGTGCGCC 


26250 


TGCAAGAACA 


AACAGAGGAG 


26300 
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66CCTACCC6 


CAGTTGGCGA 


CGAGCAGCTA 


CGAGCCTGCC 


GACTTGGAGG 


AGCGACGCAA 


TCGTTACCGT 


GGAGCTTGAG 


TGCATGCAGC 


ATGCAGCGCA 


AGCTAGAGGA 


AACATTGCAC 


CGTACGCCAG 


GCCTGCAAGA 


TCTCCAACGT 


CCTACCTTGG 


AATTTTGCAC 


GAAAACCGCC 


TCCACGCTCA 


AGGGCGAGGC 


GCGCCGCGAC 


CTTATTTCTA 


TGCTACACCT 


GGCAGACGGC 


GCTTGGAGGA 


GTGCAACCTC 


AAGGAGCTGC 


TTGAAGGACC 


TATGGACGGC 


CTTCAACGAG 


GGCGGACATC 


ATTTTCCCCG 


AACGCCTGCT 


TGCCAGACTT 


CACCAGTCAA 


AGCATGTTGC 


CTAGAGCGCT 


CAGGAATCTT 


GCCCGCCACC 


CTTTGTGCCC 


ATTAAGTACC 


GCGAATGCCC 


GCTACCTTCT 


GCAGCTAGCC 


AACTACCTTG 


GAAGACGTGA 


GCGGTGACGG 


TCTACTGGAG 


ATGCACCCCG 


CACCGCTCCC 


TGGTTTGCAA 


GTCAAATTAT 


CGGTACCTTT 


GAGCTGCAGG 


TCCGCGGCTC 


CGGGGTTGAA 


ACTCACTCCG 


CCTTCGCAAA 


TTTGTACCTG 


AGGACTACCA 


ACGAAGACCA 


ATCCCGCCCG 


CCAAATGCGG 


ACCCAGGGCC 


ACATTCTTGG 


CCAATTGCAA 


AGAGTTTCTG 


CTACGAAAGG 


GACGGGGGGT 


GCGAGGAGCT 


CAACCCAATC 


CCCCCGCCGC 


CCGCGGGCCC 


TTGCTTCCCA 


GGATGGCACC 


CGCCGCCACC 


CACGGACGAG 


GAGGAATACT 
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GCGCGCTG6C 


TTCAAACGCG 


26350 


ACTAATGATG 


GCCGCAGTGC 


26400 


GGTTCTTTGC 

ww X X X X X 


TGACCCGGAG 


26450 


T AC AC CTTT C 


GACAGGGCTA 


26500 


GGAGCTCTGC 


AA CCTGGTCT 


26550 


TTGGGCAAAA 


CGTGCTTCAT 

W w X ww X X Wl X 


«b W W W 


TACGTCCGCG 


ACTGCGTTTA 


26650 


CATGGGCGTT 


TGGCAGCAGT 


26700 


AGAAACTGCT 


AAAGCAAAAC 


26750 

V / w" w 


CGCTCCGTGG 

Www- X X ww^ 


CCGCGCACCT 

W W w ww Wf%W W X 


26800 

«w W W V W 


TAAAACCCTG 


CAACAGGGTC 
unn w av? w w x w 




AGAACTTTAG 

nUAAUX X XAV7 


G A A CTTT ATC 

w/XTlw X X XAX w 




TGCTGTGCAC 

X WW X w X W WA W 


TTCCTAGCGA 

X X ww X AVJwA 




TP f*G CC(Z PTT 

X V* UUUUU w X X 


TGGGGCCAPT 


57000 


CCTACCACTC 


TGACATAATG 


27050 


TGTCACTGTC 

X W X X X \+ 


GCTGCAACCT 

W X WnnwW X 


27100 

«W # x> w v 


TTCGCAGCTG 


CTTAACGAAA 


27150 


GTCCCTCGCC 


TGACGAAAAG 

x wx«wwi*nnnw 


27200 


GGGCTGTGGA 


CGTCGGCTTA 


27250 


CGCCCACGAG 


ATTAGGTTCT 


27300 


AGCTTACCGC 


CTGCGTCATT 


27350 


GCCATCAACA 


AAGCCCGCCA 


27400 


TTACTTGGAC 


CCCCAGTCCG 


27450 


CGCAGCCCTA 


TCAGCAGCAG 


27500 


CAAAAAGAAG 


CTGCAGCTGC 


27550 


GGGACAGTCA 


GGCAGAGGAG 


27600 
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GTTTTGG A ccz 

w X X A A vwAV^VJ 


AGGAGGAGGA 

** W W £^ W VJAV3 WX^. 


GGACATGATG 

wwAwA X wA X w 


GAAGAPTGGG 

WAAwAw X WWW 


AGAGPPTA^A 
AwAw ww XAw A 




CGAGGAAGCT 


TCCGAGGTCG 


AAGAGGTGTP 

AAwAw w X w X w 


AG APG A A APA 

AwAwwAAAwA 


PPGTPAPPPT 

www X vflvww X 


£* f 9 WW 


PGGTPGPATT 

Www X VtfUWAl X 


PPPPTPGPPG 

wwwwX wwwww 


GPGPPPP AGA 

w ww w w w wAwA. 


AATPGGPAAP 
AA X vuu w aA w 


PC^l'TTPP A CP 

www X X w wiVJ w 


£t § 1 mJ U 


A X www X AvnA 


WW X W WW W X WW 


X wAww WW www 


CFC PTPP 

WWWWWsAw X WW 


www X X wwwww 


^ / OUU 


a err 1 A 


ah ATnsmA PA 

Aw A X ubuAvA 


rr* a ptpp a a p 

w wA w X bbAA w 


wibbbUtbb X 


A APTV^OTA'lifSP 
AAw X wwAAww 




A cznncznnr* r*n 


PTfAPPPPi 21 

w X X Aw wwwAA 


pxrpa APA A P 
wAw wAAwAAw 


"hf*r , c*c*r % '& a^p 

AwwwwwAAww 


II F&TIF*&. 
wXAwwwwX wA 




TfZfZPfSPfifirsp 


APA All A APfSP 

AwAAwAAwww 


P A T A HTTP PT 
LAlAbl X wwX 


*pr*r* , T w rriP a Afi 

ibbllbUuib 


Aw X V7 X bbb 




LilAWil w X ww 


X X WWW WWW WW 


owXXX wXXwX 


wXAwwAX wAw 


WW WW X WW WW X 


zouuu 


X www w ww iAA 


P A TPPTr*P 2k T 

WAX wwX wwA-L 


rnjv PT A rprTP 
XAw X Aw wis X w 


AX wX wX AwAw 


p/tp 2V r P A PTPP 
wwwAX Aw X bL 




a fhczczfciff a 

Aw Uuu Uuuwi 




pappa lk n*PP 
wAw wAA UA Vi w 


AbUbbluAUA 


w Aw AAw wAAA 


7Qi on 


CPPP A ^PP 
OwwwAwwwwA 


X Ab UMbAL X 


CTbACAAAb C 


UwaaGAaAXU 


wAwAwwwwww" 


oqi en 


f2PAf2PA/2PAP 
wi w A w wAw Uitf 


P A pr* IV f*f* 71 fT* 

w A w w A w w A w w 


ppitippptptp 

GCTGCGTCTG 


bCbULLAALb 


7i 7i/^r«^OfTl7im^i 
AAww ww X AX w 


o q o nn 


p a fffffh ta c* 

wAwwwwwwAU* 


CX XAGAAACA 


GGATTTTTCC 


CaCTCTGTAT 


GCTATATTTC 




A a pap Ar*r | A* p * 

AAwAwAwwAo 


bbbtLAAbAA 


fill AP74 a'phip a 

CAAGAGCTGA 


AAATAAAAAA 


UAGGX C X wX G 


2oJUU 


pcatppptpa 


w w ww wAw w X w 


PPTPT ATP A P 
wwXGXAX wAw 


A7A A7A PPP A AP 


A TP A P PTTPfS 
AX wAww X X ww 


oft** t;n 




HA APAPPPPP 
wAAwAwwwwO* 


AbbUl wXwXX 


w Aw X AAA X AU 


TP PP PP PT^l A 

X WWW WW w X wA 


•5 q Ann 


w X w X X AAwwA 


LiAul X X www 


wwwwX X X wX w 


AAAX X X AAww 


PPP A A A APT A 
bUbAAAnu X A 




PGTPATPTPP 
ww X Wil w X 


A CZHClflFH A P A 
Ab www w wAwA 


wwwwwwwwwA 


bUAUL lbl wV7 


TPACPnPPAT 
X wAo'wwwwAx 


ooenn 


TiTCAPPA AP 
X A X wA w LnAu 


wAAAX X wwwa 


ww ww w X AvJAl 


wlwwAwl XAw 


PAPPPAPA A A 
Unu w wAwAAA 




rpdClCZ A PTTP P 
X www Aw X X \3 w 


www X w wAw w X 


pr»pp a apapt 
w w w wAAwAw X 


2V orn/^7v » FFCC* 
AwX wAAwwww 


71 ATA A APTAP 
AAX AAAwXAw 


6oOuU 


AT^Arzpcpnn 


HAPPPPAPAT 
w A w w w w A w A X 


P AT ATPPPPP 
bnlnl wwwww 


PTP A A PHfl A A 
V7 X wAAw w w AA 


TPPfZPflPPPA 
X www ww ww wA 


a O W ^/ v 


CCGAAACC6A 


ATTCTCTTGG 


AACAGGCGGC 


TATTACCACC 


ACACCTCGTA 


28700 


ATAACCTTAA 


TCCCCGTAGT 


TGGCCCGCTG 


CCCTGGTGTA 


CCAGGAAAGT 


28750 


CCCGCTCCCA 


CCACTGTGGT 


ACTTCCCAGA 


GACGCCCAGG 


CCGAAGTTCA 


28800 


6AT6ACTAAC 


TCAGGGGCGC 


AGCTTGCGGG 


CGGCTTTCGT 


CACAGGGTGC 


28850 


G6TCGCCC6G 


GCAGGGTATA 


ACTCACCTGA 


CAATCAGAGG 


GCGAGGTATT 


28900 
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CAGCTCAACG 


ACGAGTCGGT 


GAGCTCCTCG 


GACATTTCAG 


ATCGGCGGCG 


CCGGCCGTCC 


CAATCCTAAC 


TCTGCAGACC 


TCGTCCTCTG 


GGAACTCTGC 


AATTTATTGA 


GGAGTTTGTG 


CTTCTCGGGA 


CCTCCCGGCC 


ACTATCCGGA 


ACGCG6TAAA 


GGACTCGGCG 


GACGGCTACG 


GCAGAGCAAC 


TGCGCCTGAA 


ACACCTGGTC 


CTTTGCCCGC 


GACTCCGGTG 


AGTTTTGCTA 


ATATCGAGGG 


CCCGGCGCAC 


GGCGTCCGGC 


GCCCGTAGCC 


TGATTCGGGA 


GTTTACCCAG 


GGACAGGGGA 


CCCTGTGTTC 


TCACTGTGAT 


GATTACATCA 


AGATCTTTGT 


TGCCATCTCT 


AGAAATTAAA 


ATATACTGGG 


GCTCCTATCG 


TCTTCACCCG 


CCCAAGCAAA 


CCAAGGCGAA 


ATCTCTCCCT 


CTGTGATTTA 


CAACAGTTTC 


ACGAGAGAAC 


CTCTCCGAGC 


TCAGCTACTC 


TCCTTACCTG 


CCGGGAACGT 


ACGAGTGCGT 


CTACCGCCTG 


ACCGTAAACC 


AGACTTTTTC 


TGTTTACCAG 


AACAGGAGGT 


GAGCTTAGAA 


AAAGGCGCAG 


CTACTGTGGG 


GTTTATGAAC 


CTATTCTAAT 


TCAGGTTTCT 


CTAGAATCGG 


TTGTGATTCT 


CTTTATTCTT 


ATACTAACGC 


GCCTGCTGTG 


TGCACATTTG 


CATTTATTGT 


TCGCCACCCA 


AGATGATTAG 


GTACATAATC 


GTCAGCCCAC 


GGTACCACCC 


AAAAGGTGGA 


ATGTTACATT 


CGCAGCTGAA 


GCTAATGAGT 
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CTTGGTCTCC 


GTCCGGACGG 


28950 


TTCATTCACG 


CCTCGTCAGG 


29000 


AGCCGCGCTC 


TGGAGGCATT 


29050 


CCATCGGTCT 


ACTTTAACCC 


29100 


TCAATTTATT 

Mb ^b ^b>**^b ^b 


CCTAACTTTG 

^^^^ ^b A A* ^b ^b 


29150 


AOTGAATGTT 

w x w x x 


AAGTGGAGAG 


29200 


Wiv* X w X ww ww 


GCCACAAGTG 


29250 


wX x luAAl XV7 


PCCGAGGATC 

wwwwAwAJAX w 


£7 JUU 


X InUUuwwwl 


GGGAGAGCTT 

w*w*w°AwAw*wX X 




WW w w w w w X ww 


TAGTTGAGCG 

XXlw X XwAwww 


29400 


TTGCAACTGT 

X XwwAftwXwX 


CCTAACCTTG 

wwXAAwwX X w 


29450 


GTGCTGAGTA 

w X w w X UftU X £\ 


TAATAAATAC 


29500 


WwAX wwX w*X A 


AAWw'wwAwww* 




CCTT A C CFG G 

wwX XX^w w X ww 


XAwX X X InAv 


29600 

M+ J W W W 


AACCCAGACG 


GAGTGAGTCT 


29650 


CATCAGAAAA 

n»x*x vnwxwm 


AACACCACCC 

**** **** V* V***^* X* 


29700 


CACCGGCCGC 


TGCACCACAC 


29750 


CGGACAGACC 

W w Wn WlWnV w 


TCAATAACTC 

x x nnw x 


29800 


AACCCTTAGG 

AfXwwwX X Aww 


GTATTAGGCC 

wxn* xx»w*i*ww 


29850 

*S \J *J \J 


AATTCAAGCA 

fUl X X WUIWW1 


ACTCTACGGG 

X%w X w X **V^VJSJSJ 


29900 


GGTTGGGGTT 


ATTCTCTGTC 

nx x wx w x w 


29950 


TTCTCTGCCT 


AAGGCTCGCC 


30000 


CAGCTTTTTA 


AACGCTGGGG 


30050 


CTAGGTTTAC 


TCACCCTTGC 


30100 


TTTTAAGGAG 


CCAGCCTGTA 


30150 


GCACCACTCT 


TATAAAATGC 


30200 
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ACCACAGAAC 


ATGAAAAGCT 


GCTTATTCGC 


CACAAAAACA 


AAATTGGCAA 


30250 


GTATGCTGTT 


TATGCTATTT 


GGCAGCCAGG 


TGACACTACA 


GAGTATAATG 


30300 


TTACAGTTTT 


CCAGGGTAAA 

^VX«A A^^^rf A>AAAA*A 


AGTCATAAAA 


CTTTTATGTA 


TACTTTTCCA 


30350 


TTTTATGAAA 


TGTGCGACAT 


TACCATGTAC 


ATGAGCAAAC 


AGTATAAGTT 


30400 


GTGGCCCCCA 


CAAAATTGTG 


TGGAAAACAC 


TGGCACTTTC 


TGCTGCACTG 


30450 


CTATGCTAAT 


TACAGTGCTC 


GCTTTGGTCT 


GTACCCTACT 


CTATATTAAA 


30500 


TACAAAAGCA 


GACGCAGCTT 


TATTGAGGAA 


AAGAAAATGC 


CTTAATTTAC 


30550 


TAAGTTACAA 


AGCTAATGTC 


ACCACTAACT GCTTTACTCG 


CTGCTTGCAA 


30600 


AACAAATTCA 


AAAAGTTAGC 


ATTATAATTA 


GAATAGGATT 


TAAACCCCCC 


30650 


GGTCATTTCC 

N^VJ A> %***A> JL JL V* 


TGCTCAATAC 


CATTCCCCTG 


AACAATTGAC 


TCTATGTGGG 


30700 


ATATGCTCCA 


GCGCTACAAC 


CTTGAAGTCA 


GGCTTCCTGG 


ATGTCAGCAT 


30750 


CTGACTTTGG 


CCAGCACCTG 


TCCCGCGGAT 


TTGTTCCAGT 


CCAACTACAG 


30800 


CGACCCACCC 


TAACAGAGAT 

A AAA* A A>* A A A 


GACCAACACA 


ACCAACGCGG 


CCGCCGCTAC 


30850 


CGGACTTACA 


TCTACCACAA 

A> A AA^* ^*AA%*AAAA 


ATACACCCCA 


AGTTTCTGCC 


TTTGTCAATA 


30900 


ACTGGGATAA 

AA** * ^* A* A 


CTTGGGCATG 


TGGTGGTTCT 


CCATAGCGCT 


TATGTTTGTA 


30950 


TGCCTTATTA 

A> A* A> 4 A A> AAA 


TTATGTGGCT 


CATCTGCTGC 


CTAAAGCGCA 


AACGCGCCCG 


31000 


ACCACCCATC 

AA^^^*AA^*^^^*AA A 


TATAGTCCCA 


TCATTGTGCT 


ACACCCAAAC 


AATGATGGAA 


31050 


TCCATAGATT 


GGACGGACTG 


AAACACATGT TCTTTTCTCT 


TACAGTATGA 


31100 


TTAAATGAGA 


CATGATTCCT 

♦ ^b * ^» ^b ^^^^ *^ 


CGAGTTTTTA 


TATTACTGAC 


CCTTGTTGCG 


31150 


CTTTTTTGTG 


CGTGCTCCAC 


ATTGGCTGCG 


GTTTCTCACA 


TCGAAGTAGA 


31200 


CTGCATTCCA 


GCCTTCACAG 


TCTATTTGCT 


TTACGGATTT 


GTCACCCTCA 


31250 


CGCTCATCTG 


CAGCCTCATC 


ACTGTGGTCA 


TCGCCTTTAT 


CCAGTGCATT 


31300 


GACTGGGTCT 


GTGTGCGCTT 


TGCATATCTC 


AGACACCATC 


CCCAGTACAG 


31350 


GGACAGGACT 


ATAGCTGAGC 


TTCTTAGAAT 


TCTTTAATTA 


TGAAATTTAC 


31400 


TGTGACTTTT 


CTGCTGATTA 


TTTGCACCCT 


ATCTGCGTTT 


TGTTCCCCGA 


31450 


CCTCCAAGCC 


TCAAAGACAT 


ATATCATGCA 


GATTCACTCG 


TATATGGAAT 


31500 
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ATTCCAAGTT 


GCTACAATGA 


AAAAAGCGAT 


CTTTCCGAAG 


CCTGGTTATA 


31550 


TGCAATCATC 


TCTGTTATGG 


TGTTCTGCAG 


TACCATCTTA 


GCCCTAGCTA 


31600 


TATATCCCTA 


CCTTGACATT 


GGCTGGAAAC 


GAATAGATGC 


CATGAACCAC 


31650 


CCAACTTTCC 


CCGCGCCCGC 


TATGCTTCCA 


CTGCAACAAG 


TTGTTGCCGG 


31700 


CGGCTTTGTC 


CCAGCCAATC 


AGCCTCGCCC 


CACTTCTCCC 


ACCCCCACTG 


31750 

X # *J W 


AAATCAGCTA 


CTTTAATCTA 

X A J» **** ^ X ** 


ACAGGAGGAG 


ATGACTGACA 
a x wAw x wnvn 


CCCTAGATCT 


J lO VI w 


A6AAATG6AC 

**^J ****** A V^Xrf** V* 


GG AATTATT A 

NJVJX*** X X ** X X ** 


CAGAGCAGPG 


CCTGPTAGAA 

ww X V? w Xauaa 






CAGCGGCCGA 


GCAACAGCGC 

*w ^^****^^X*^J 


ATGAATCAAG 


AGCTCPAAGA 
au w X vvnnvjn 


PATGGTTAAP 

WAX ww X XAAw 




TTGCACCAGT 


GCAAAAGGGG 


TATCTTTTGT 

«fc n* w * * * X \rf A 


CTGGTAAAGC 


AGGCCAAAGT 

X*w wUUl\J X 


w X7 J W 


CACCTACGAC 


AGTAATACCA 


CCGGACACCG 


CCTTAGCTAC 


AAGTTGCCAA 


32000 


CCAAGCGTCA 


GAAATTGGTG 


GTCATGGTGG 


GAGAAAAGCC 

Vrf**V7********Vrf V* 


CATTACCATA 

WX^X XAwVA*n 


32050 

*J £* \J *J \J 


ACTCAGCACT 


CGGTAGAAAC 


CGAAGGCTGC 


ATTCAOTCAC 


CTTGTCAAGG 

w X X w X W1A w w 


32100 


ACCTGAGGAT 


CTCTGCACCC 


TTATTAAGAC 


CCTGTG CGGT 

w WX X W^ ^U\J* 


CTCAAAGATC 

w X vAAAU A X W 


32150 

•J ** X j w 


TTATTCCCTT 


TAACTAATAA 


AAAAAAATAA 
* ** *********** ^ ***& 


TAAAGCATCA 

XAAAwWTXX WA 


CTT ACTTA A A 

w X X2*^w X X AAA 


•9 £ £ WW 


ATCAGTTAGC 


AAATTTCTGT 


CCAGTTTATT 

^b ^b A A A ^b 


CAGCAGCACC 


TCCTTGCCCT 

X WWX XV9WWWX 


32250 

*J >m* \J 


CCTCCCAGCT 


CTGGTATTGC 


AGCTTCCTCC 


TGGCTGCAAA 


CTTTCTCCAC 

\* XXX \* X wx*«^ 


32300 

•J A <J w w 


AATCTAAATG 


GAATGTCAGT 


TTCCTCCTGT 


TCCTGTCCAT 


CCGCACCCAC 


32350 


TATCTTCATG 


TTGTTGCAGA 


TGAAGCGCGC 


AAGACCGTCT 


GAAGATACCT 


32400 


TCAACCCCGT 


GTATCCATAT 


GACACGGAAA 


CCGGTCCTCC 


AACTGTGCCT 


32450 

V A *x w 


TTTCTTACTC 


CTCCCTTTGT 


ATCCCCCAAT 


GGGTTTCAAG 


AGAGTCCCCC 


32500 


TGGGGTACTC TCTTTGCGCC TATCCGAACC TCTAGTTACC TCCAATGGCA 


32550 


TGCTTGCGCT 


CAAAATGGGC 


AACGGCCTCT 


CTCTGGACGA 


GGCCGGCAAC 


32600 


CTTACCTCCC 


AAAATGTAAC 


CACTGTGAGC 


CCACCTCTCA 


AAAAAACCAA 


32650 


GTCAAACATA AACCTGGAAA 


TATCTGCACC 


CCTCACAGTT 


ACCTCAGAAG 


32700 


CCCTAACTGT 


GGCTGCCGCC 


GCACCTCTAA 


TGGTCGCGGG 


CAACACACTC 


32750 


ACCATGCAAT 


CACAGGCCCC 


GCTAACCGTG 


CACGACTCCA AACTTAGCAT 


32800 
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T6CCACCCAA 


GGACCCCTCA 


CA6T6TCA6A 


AGGAAAGCTA 


GCCCTGCAAA 

w w w x w^ vnnn 


32850 


CATCAGGCCC 


CCTCACCACC 


ACCGATAGCA 


GTACCCTTAC 


TATCACTGCC 

X'*X W%w X W^w w 


32900 


TCACCCCCTC 


TAACTACTGC 


CACTGGTAGC 


TTGGGCATTG 


ACTTGAAAGA 

•** W A, X UAAAV7A 


32950 

m* ^ W 


GCCCATTTAT 

\9 WWW«»X A AdX 


ACACAAAATG 


GAAAACTAGG 


ACTAAAGTAP 

Aw X AAAwX Aw 


GGGfiiPTPPTT 

WWWWwX wwX X 


^ w v w w 


TGCATGTAAC 


AGACGACCTA 


A A CACTTTG A 

£1X»W«* w x a x w^n 


CCGTAGPAAP 

www X AW WAAW 


TGGTPPAGGT 

X WWX w>WAWWX 




GTGACTATTA 


ATAATACTTC 


PTTGCAAACT 


AAAGTT A CTG 

AAAW X XAw X W 


GAGPPTTGGG 

WAW ww X X Www 


ww XVU 


TTTTGATTCA 

^b «fc ^b ^^•**b ^b * 


CAAGGCAATA 


TGCAACTTAA 


TGTAGCAGGA 


GGACTAAGGA 


33150 


TTGATTCTCA 

^b ^b vAA ^b ^b ^# ^b A 


AAACAGACGC 


PTTATACTTG 

V»X XAaRWA X\7 


ATGTT AGTT A 

AXWX 1AVJ1 1A 


TPPGTTTGAT 

X w wW X X X WAX 


33500 

J W M \S W 


GCTCAAAACC 


AACTAAATCT 


A AG ACTAGG A 


wAwwwwww X w 


rpTTTTATA A A 

X X X X X AX AAA 




CTCAGCCCAC 

WX wX^W^WWwXaw 


A A PTTGG AT A 


TT A A PT A P A A 

X lAAv X AUAA 


PA A AfinPPTT 

WAAAWWWWX X 


T APTTCTTTA 

X AwX X w"X X X A 


JJJvU 


CAGCTTCAAA 

WX&W W X X vnnA 


PAATTCCAAA 
waax x wwaaa 


A A fSPTTC A fZfZ 

AAwWX X WAWW 


mm a tv r»OT A A /2 

X XAAwwXAAW 


PA PT/2PP AAA 

WAW X WW WAAW 




GGGTTGATGT 

WWW X X vAl W X 


TTG A rCPT A P 

X X WAWW W X Aw 


AfSPPATACPP 


att a atppap 
aX IaAI WwAw 


PAPATCC/iPT 
wawaX WWWw X 




TGAATTTGGT 

x wnn4 x x v?^7 x 


TPACCTAATG 

X WlvUlAAl W 


PAPPAAAPAP 

WAWWAAAWaW 


A A ATPPPPTP 

AAAX ww ww X w 


A A A APA A A A A 

AAAAwAAAAA 




TTGGCCATGG 

X X wwWWl X ww 


WWXAwAAX X X 


fZATTPA A APA 

WAX X WAAAWA ' 


AWWwXAX wwX 


TPPTA A A PT A 
X WwXaaawXa 




GGAACTGGCC 


TT A GTTTTG A 

X XAWX X X 1UA 


PAGPAPAGGT 

WAW WAWAwW X 


nPP ATT AP AC 

WWwAX XAwAW 


TAAAAAAPAA 

X A WW AAA WAA 


w^ w*9w 


AAATAATGAT 


AAGCTAAOTT 

•AAWWXAJT&WX X 


TGTGGACPAP 

X WX W WAW WAW 


APPACPTPPA 

AwwAVJ w X WWA 


TPTPPT A A PT 
x wx wwxaawx 


33600 

•J w w w w 


GTAGACTAAA 


TGP AG AG A A A 

X w UAwAwnAn 


G ATGPT A A A P 

WA X ww X AAA W 


TP A PTTT/IfilT 
X wAwX x X wwX 


PTTA APA A A A 
w X X aawaaaa 




TGTGGCAGTC 

X W X WWW«lW X W 


AAATAPTTGP 

AAAXAWX X WW 


TAPAGTTTPA 
X A WAW xxx w\ 


nTTTTCrtPTIZ 
wX X X X wwwX w 


TT A A AGGP AG 

X X AAAWWWAw 


33700 


TTTGGPTPPA 

X X X WWW X WWA 


ATATPTGG A A 

AX AX WX WWAA 


C AGTTP A A A G 

vAUl X WAAAW 


TCPTPATPTT 
X WwX waX wX X 


ATTATAAGAT 

AX XAXAAwAX 


33750 

WW / wv 


TTGACGAAAA 
x xwawwaaaa 


TGGAGTGPTA 

X ww AW X WW X A 


PTAA APA ATT 

W X AAA WAA X X 


PPTTPPTGGA 

ww X X ww X WWA 


CCCAGAATAT 

vvVAUAAlAl 


33800 

W W w w W 


TGG AACTTTA 

x wwrir»v xxx *» 


GAAATGGAGA 

W AAA X ww Aw A 


TCTTAPTGAA 

X W X X Aw X WAA 


GGCACAGCCT 

WW wAwXIW w W X 


ATACAAACGC 


33850 


TGTTGGATTT 


ATGCCTAACC 


TATCAGCTTA 


TCCAAAATCT 


CACGGTAAAA 


33900 


CTGCCAAAAG 


TAACATTGTC 


AGTCAAGTTT 


ACTTAAACGG 


AGACAAAACT 


33950 


AAACCTGTAA 


CACTAACCAT 


TACACTAAAC 


GGTACACAGG 


AAACAGGAGA 


34000 


CACAACTCCA 


AGTGCATACT 


CTATGTCATT 


TTCATGGGAC 


TGGTCTGGCC 


34050 


ACAACTACAT 


TAATGAAATA 


TTTGCCACAT 


CCTCTTACAC 


TTTTTCATAC 


34100 
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ATTGCCCAAG 


AATAAAGAAT 


CGTTTGTGTT 


ATGTTTCAAC 


GTGTTTATTT 


34150 


TTCAATTGCA 


GAAAATTTCA 


AGTCATTTTT 


CATTCAGTAG 


TATAGCCCCA 


34200 


CCACCACATA 


GCTTATACAG 


ATCACCGTAC 


CTTAATCAAA 


CTCACAGAAC 


34250 


CCTAGTATTC 


AACCTGCCAC 


CTCCCTCCCA 


ACACACAGAG 


TACACAGTCC 


34300 


TTTCTCCCCG 


GCTGGCCTTA 


AAAAGCATCA 


TATCATGGGT 


AACAGACATA 


34350 


TTCTTAGGTG 


TTATATTCCA 


CACGGTTTCC 


TGTCGAGCCA 


AACGCTCATC 


34400 


AGTGATATTA 


ATAAACTGGC 


GGCGATATAA 


AATGCAAGGT 


GCTGCTCAAA 


34450 


AAATCAGGCA 


AAGCCTCGCG 


CAAAAAAGAA 


AGCACATCGT 


AGTCATGCTC 


34500 


ATGCAGATAA 


AGGCAGGTAA 


GCTCCGGAAC 


CACCACAGAA 


AAAGACACCA 


34550 


TTTTTCTCTC 


AAACATGTCT 


GCGGGTTTCT 


GCATAAACAC 


AAAATAAAAT 


34600 


AACAAAAAAA 


CATTTAAACA 


TTAGAAGCCT 


GTCTTACAAC 


AGGAAAAACA 


34650 


ACCCTTATAA 


GCATAAGACG 


GACTACGGCC 


ATGCCGGCGT 


GACCGTAAAA 


34700 


AAACTGGTCA 


CCGTGATTAA 


AAAGCACCAC 


CGACAGCTCC 


TCGGTCATGT 


34750 


CCGGAGTCAT 


AATGTAAGAC 


TCGGTAAACA 


CATCAGGTTG 


ATTCATCGGT 


34800 


CAGTGCTAAA 


AAGCGACCGA 


AATAGCCCGG 


GGGAATACAT 


ACCCGCAGGC 


34850 


GTAGAGACAA 


CATTACAGCC 


CCCATAGGAG 


GTATAACAAA 


ATTAATAGGA 


34900 


GAGAAAAACA 


CATAAACACC 


TGAAAAACCC 


TCCTGCCTAG 


GCAAAATAGC 


34950 


ACCCTCCCGC 


TCCAGAACAA 


CATACAGCGC 


TTCACAGCGG 


CAGCCTAACA 


35000 


GTCAGCCTTA 


CCAGTAAAAA 


AGAAAACCTA 


TTAAAAAAAC 


ACCACTCGAC 


35050 


ACGGCACCAG 


CTCAATCAGT 


CACAGTGTAA 


AAAAGGGCCA 


AGTGCAGAGC 


35100 


GAGTATATAT 


AGGACTAAAA 


AATGACGTAA 


CGGTTAAAGT 


CCACAAAAAA 


35150 


CACCCAGAAA 


ACCGCACGCG 


AACCTACGCC 


CAGAAACGAA 


AGCCAAAAAA 


35200 


CCCACAACTT 


CCTCAAATCG 


TCACTTCCGT 


TTTCCCACGT 


TACGTAACTT 


35250 


CCCATTTTAA 


GAAAACTACA 


ATTCCCAACA 


CATACAAGTT 


ACTCCGCCCT 


35300 


AAAACCTACG 


TCACCCGCCC 


CGTTCCCACG 


CCCCGCGCCA 


CGTCACAAAC 


35350 
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TCCACCCCCT CATTATCATA TTGGCTTCAA TCCAAAATAA GGTATATTAT 35400 
TGATGATG 35408 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GCCCAATACG 


CAAACCGCCT 


CTCCCCGCGC 


GTTGGCCGAT 


TCATTAATGC 


50 


AGCTGCGCGC 


TCGCTCGCTC 


"ACTGAGGCCG 


CCCGGGCAAA 


GCCCGGGCGT 


100 


CGGGCGACCT 


TTGGTCGCCC 


GGCCTCAGTG 


AGCGAGCGAG 


CGCGCAGAGA 


150 


GGGAGTGGCC 


AACTCCATCA 


CTAGGGGTTC 


CTTGTAGTTA 


ATGATTAACC 


200 


CGCCATGCTA 


CTTATCTACG 


TAGCCATTCT 


CTAGCCCCTG 


CAGGTCGTTA 


250 


CATAACTTAC 


GGTAAATGGC 


CCGCCTGGCT 


GACCGCCCAA 


CGACCCCCGC 


300 


CCATTGACGT 


CAATAATGAC 


GTATGTTCCC 


ATAGTAACGC 


CAATAGGGAC 


350 


TTTCCATTGA 


CGTCAATGGG 


TGGAGTATTT 


ACGGTAAACT 


GCCCACTTGG 


400 


CAGTACATCA 


AGTGTATCAT 


ATGCCAAGTA 


CGCCCCCTAT 


TGACGTCAAT 


450 


GACGGTAAAT 


GGCCCGCCTG 


GCATTATGCC 


CAGTACATGA 


CCTTATGGGA 


500 


CTTTCCTACT 


TGGCAGTACA 


TCTACGTATT 


AGTCATCGCT 


ATTACCATGG 


550 


TGATGCGGTT 


TTGGCAGTAC 


ATCAATGGGC 


GTGGATAGCG 


GTTTGACTCA 


600 


CGGGGATTTC 


CAAGTCTCCA 


CCCCATTGAC 


GTCAATGGGA 


GTTTGTTTTG 


650 


GCACCAAAAT 


CAACGGGACT 


TTCCAAAATG 


TCGTAACAAC 


TCCGCCCCAT 


700 


TGACGCAAAT 


GGGCGGTAGG 


CGTGTACGGT 


GGGAGGTCTA 


TATAAGCAGA 


750 


GCTCGTTTAG 


TGAACCGTCA 


GATCGCCTGG 


AGACGCCATC 


CACGCTGTTT 


800 


TGACCTCCAT 


AGAAGACACC 


GGGACCGATC 


CAGCCTCCGG 


ACTCTAGAGG 


850 


ATCCGGTACT 


CGAGGAACTG 


AAAAACCAGA 


AAGTTAACTG 


GTAAGTTTAG 


900 
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TCTTTTTGTC 


TTTTATTTCA 


GGTCCCGGAT 


CCGGTGGTGG 


TGCAAATCAA 


950 


AGAACT6CTC 


CTCAGTGGAT 


GTTGCCTTTA 


CTTCTAGGCC 


TGTACGGAAG 


1000 


TGTTACTTCT 


GCTCTAAAAG 


CTGCGGAATT 


GTACCCGCGG 


CCGCAATTCC 


1050 


CGGGGATCGA 


AAGAGCCTGC 


TAAAGCAAAA 


AAGAAGTCAC 


CATGTCGTTT 


1100 


ACTTTGACCA 


ACAAGAACGT 


GATTTTCGTT 


GCCGGTCTGG 


GAGGCATTGG 


1150 


TCTGGACACC 


AGCAAGGAGC 


TGCTCAAGCG 


CGATCCCGTC 


GTTTTACAAC 


1200 


GTCGTGACTG 


GGAAAACCCT 


GGCGTTACCC 


AACTTAATCG 


CCTTGCAGCA 


1250 


CATCCCCCTT 


TCGCCAGCTG 


GCGTAATAGC 


GAAGAGGCCC 


GCACCGATCG 


1300 


CCCTTCCCAA 


CAGTTGCGCA 


GCCTGAATGG 


CGAATGGCGC 


TTTGCCTGGT 


1350 


TTCCGGCACC 


AGAAGCGGTG 


CCGGAAAGCT 


GGCTGGAGTG 


CGATCTTCCT 


1400 


GAGGCCGATA 


CTGTCGTCGT 


CCCCTCAAAC 


TGGCAGATGC 


ACGGTTACGA 


1450 


TGCGCCCATC 


TACACCAACG 


TAACCTATCC 


CATTACGGTC 


AATCCGCCGT 


1500 


TTGTTCCCAC 


GGAGAATCCG 


ACGGGTTGTT 


ACTCGCTCAC 


ATTTAATGTT 


1550 


GATGAAAGCT 


GGCTACAGGA 


AGGCCAGACG 


CGAATTATTT 


TTGATGGCGT 


1600 


TAACTCGGCG 


TTTCATCTGT 


GGTGCAACGG 


GCGCTGGGTC 


GGTTACGGCC 


1650 


AGGACAGTCG 


TTTGCCGTCT 


GAATTTGACC 


TGAGCGCATT 


TTTACGCGCC 


1700 


GGAGAAAACC 


GCCTCGCGGT 


GATGGTGCTG 


CGTTGGAGTG 


ACGGCAGTTA 


1750 


TCTGGAAGAT 


CAGGATATGT 


GGCGGATGAG 


CGGCATTTTC 


CGTGACGTCT 


1800 


CGTTGCTGCA 


TAAACCGACT 


ACACAAATCA 


GCGATTTCCA 


TGTTGCCACT 


1850 


CGCTTTAATG 


ATGATTTCAG 


CCGCGCTGTA 


CTGGAGGCTG 


AAGTTCAGAT 


1900 


GTGCGGCGAG 


TTGCGTGACT 


ACCTACGGGT 


AACAGTTTCT 


TTATGGCAGG 


1950 


GTGAAACGCA 


GGTCGCCAGC 




Lll J. vobUuu 




2000 


GATGAGCGTG 


GTGGTTATGC 


CGATCGCGTC 


ACACTACGTC 


TGAACGTCGA 


2050 


AAACCCGAAA 


CTGTGGAGCG 


CCGAAATCCC 


GAATCTCTAT 


CGTGCGGTGG 


2100 


TTGAACTGCA 


CACCGCCGAC 


GGCACGCTGA 


TTGAAGCAGA 


AGCCTGCGAT 


2150 


GTCGGTTTCC GCGAGGTGCG 


GATTGAAAAT 


GGTCTGCTGC 


TGCTGAACGG 


2200 
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CAAGCCGTTG 


CTGATTCGAG 


GCGTTAACCG 


TCACGAGCAT 


CATCCTCTGC 


2250 


ATGGTCAGGT 


CATGGATGAG 


CAGACGATGG 


TGCAGGATAT 


CCTGCTGATG 


2300 


AAGCAGAACA 


ACTTTAACGC 


CGTGCGCTGT 


TCGCATTATC 


CGAACCATCC 


2350 


GCTGTGGTAC 


ACGCTGTGCG 


ACCGCTACGG 


CCTGTATGTG 


GTGGATGAAG 


2400 


CCAATATTGA 


AACCCACGGC 


ATGGTGCCAA 


TGAATCGTCT 


GACCGATGAT 


2450 


CCGCGCTGGC 


TACCGGCGAT 


GAGCGAACGC 


GTAACGCGAA 


TGGTGCAGCG 


2500 


CGATCGTAAT 


CACCCGAGTG 


TGATCATCTG 


GTCGCTGGGG 


AATGAATCAG 


2550 


GCCACGGCGC 


TAATCACGAC 


GCGCTGTATC 


GCTGGATCAA 


ATCTGTCGAT 


2600 


CCTTCCCGCC 


CGGTGCAGTA 


TGAAGGCGGC 


GGAGCCGACA 


CCACGGCCAC 


2650 


CGATATTATT 


TGCCCGATGT 


ACGCGCGCGT 


GGATGAAGAC 


CAGCCCTTCC 


2700 


CGGCTGTGCC 


GAAATGGTCC 


ATCAAAAAAT 


GGCTTTCGCT 


ACCTGGAGAG 


2750 


ACGCGCCCGC 


TGATCCTTTG 


CGAATACGCC 


CACGCGATGG 


GTAACAGTCT 


2800 


TGGCGGTTTC 


GCTAAATACT 


GGCAGGCGTT 


TCGTCAGTAT 


CCCCGTTTAC 


2850 


AGGGCGGCTT 


CGTCTGGGAC 


TGGGTGGATC 


AGTCGCTGAT 


TAAATATGAT 


2900 


GAAAACGGCA 


ACCCGTGGTC 


GGCTTACGGC 


GGTGATTTTG 


GCGATACGCC 


2950 


GAACGATCGC 


CAGTTCTGTA 


TGAACGGTCT 


GGTCTTTGCC 


GACCGCACGC 


3000 


CGCATCCAGC 


GCTGACGGAA 


GCAAAACACC 


AGCAGCAGTT 


TTTCCAGTTC 


3050 


CGTTTATCCG 


GGCAAACCAT 


CGAAGTGACC 


AGCGAATACC 


TGTTCCGTCA 


3100 


TAGCGATAAC 


GAGCTCCTGC 


ACTGGATGGT 


GGCGCTGGAT 


GGTAAGCCGC 


3150 


TGGCAAGCGG 


TGAAGTGCCT 


CTGGATGTCG 


CTCCACAAGG 


TAAACAGTTG 


3200 


ATTGAACTGC 


CTGAACTACC 


GCAGCCGGAG 


AGCGCCGGGC 


AACTCTGGCT 


3250 


CACAGTACGC 


GTAGTGCAAC 


CGAACGCGAC 


CGCATGGTCA 


GAAGCCGGGC 


3300 


ACATCAGCGC 


CTGGCAGCAG 


TGGCGTCTGG 


CGGAAAACCT 


CAGTGTGACG 


3350 


CTCCCCGCCG 


CGTCCCACGC 


CATCCCGCAT 


CTGACCACCA 


GCGAAATGGA 


3400 


TTTTTGCATC 


GAGCTGGGTA 


ATAAGCGTTG 


GCAATTTAAC 


CGCCAGTCAG 


3450 


GCTTTCTTTC 


ACAGATGTGG 


ATTGGCGATA 


AAAAACAACT 


GCTGACGCCG 


3500 
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CTGCGCGATC 


AGTTCACCCG 


TGCACCGCTG 


GATAACGACA 


TTGGCGTAAG 


3550 


TGAAGCGACC 


CGCATTGACC 


CTAACGCCTG 


GGTCGAACGC 


TGGAAGGCGG 


3600 


CGGGCCATTA 


CCAGGCCGAA 


GCAGCGTTGT 


TGCAGTGCAC 


GGCAGATACA 


3650 


CTTGCTGATG 


CGGTGCTGAT 


TACGACCGCT 


CACGCGTGGC 


AGCATCAGGG 

mrn^m ^b*a m m» ^B*A 


3700 


GAAAACCTTA 


TTTATCAGCC 


GGAAAACCTA 


CCGGATTGAT 


GGTAGTGGTC 


3750 


AAATGGCGAT 


TACCGTTGAT 


GTTGAAGTGG 


CGAGCGATAC 


ACCGGATCCG 

m m-m& ^p*A m mV ^b» 


3800 


GCGCGGATTG 


GCCTGAACTG 


CCAGCTGGCG 


CAGGTAGCAG 


AGCGGGTAAA 

A AW >^ Nrf A X^*A<* 


3850 


CTGGCTCGGA 


TTAGGGCCGC 


AAGAAAACTA 


TCCCGACCGC 


CTTACTGCCG 


3900 


CCTGTTTTGA 


CCGCTGGGAT 


CTGCCATTGT 


CAGACATGTA 


TACCCCGTAC 


3950 


GTCTTCCCGA 


GCGAAAACGG 


TCTGCGCTGC 


GGGACGCGCG 


AATTGAATTA 


4000 


TGGCCCACAC 


CAGTGGCGCG 


GCGACTTCCA 


GTTCAACATC 


AGCCGCTACA 


4050 


GTCAACAGCA 


ACTGATGGAA 


ACCAGCCATC 


w uwn J. w X ww X 


GCACGCGGAA 


4100 


GAAGGCACAT 


GGCTGAATAT 


CGACGGTTTC 


CATATGGGGA 


TTGGTGGCGA 


4150 


CGACTCCTGG 


AGCCCGTCAG 


TATCGGCGGA 


ATTACAGCTG 


AGCGCCGGTC 


4200 

*v a w v 


GCTACCATTA 


CCAGTTGGTC 


TGGTGTCAAA 


AATAATAATA 


ACCGGGCAGG 


4250 


CCATGTCTGC 


CCGTATTTCG 


CGTAAGGAAA 


TCCATTATGT 

a a a*»av*a 


ACTATTTAAA 


4300 


AAACACAAAC 


TTTTGGATGT 


TCGGTTTATT 


CTTTTTCTTT 

A A * A A A A A 


TACTTTTTTA 

A m+ m+ m* m* m* A** 


4350 


TCATGGGAGC 


CTACTTCCCG 


TTTTTCCCGA 


TTTGGCTACA 


TGACATCAAC 


4400 


CATATCAGCA 


AAAGTGATAC 


GGGTATTATT 


TTTGCCGCTA 

mm %b* %b* «tV m A 


TTTCTCTGTT 

AAA A A W A A 


4450 


CTCGCTATTA 


TTCCAACCGC 


TGTTTGGTCT 


GCTTTCTGAC 


AAACTCGGCC 


4500 


TCGACTCTAG 


GCGGCCGCGG 


GGATCCAGAC 


ATGATAAGAT 


ACATTGATGA 


4550 


GTTTGGACAA 


ACCACAACTA 


GAATGCAGTG 


AAAAAAATGC 


TTTATTTGTG 


4600 


AAATTTGTGA 


TGCTATTGCT 


TTATTTGTAA 


CCATTATAAG 


CTGCAATAAA 


4650 


CAAGTTAACA 


ACAACAATTG 


CATTCATTTT 


ATGTTTCAGG 


TTCAGGGGGA 


4700 


GGTGTGGGAG 


GTTTTTTCGG 


ATCCTCTAGA 


GTCGACCTGC 


AGGGGCTAGA 


4750 


ATGGCTACGT 


AGATAAGTAG 


CATGGCGGGT 


TAATCATTAA 


CTACAAGGAA 


4800 
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LLLL 1 ASa X bA 


IbbAul XvitjU 




\* X bubLbu X C 


I3WX UbL X WiC 




X GAu{?Uub(?o 


UVjAL-WIAAVjLj 


X Lb^wUbAUb 


LUtbuuL XXX 


bUUUbbbubb 




V»L X <JAVj 1 IvAv 


LuAbLbAbLb 


LbUiibUlbbL 


blAAlAbUbA 


AOnV7V7 W V* \3 W 




ACCGATCGCC 


CTTCCCAACA 


GTTGCGGAGC 


CTGAATGG CG 


AAXGGAAX XC 


cnnn 


CAGACGATTG 


AGCGTCAAAA 


m^mn ^^mik mm 

TGTAGGTATT 


TCCATGAGCG 


TTTTTCCTGT 


C C f\ 

DODO 


TGCAATGGCT 


GGCGGTAATA 


TTGTTCTGGA 


mil mm ?v /^^tiv ^ 

TATTACCAGC 


AAGGCCGATA 


DXUO 


GTTTGAGTTC 


TTCTACTCAG 


GCAAGTGATG 


mm^ mm n % 

TTATTACTAA 


m^>iv iv 7i ^ TV TV ^rp 
TCAAAGAAGT 


5X50 


ATTGCGACAA 


CGGTTAATTT 


GCGTGATGGA 


CAGACTCTTT 


TAwTCGGXGG 




CCTCACTGAT 


TATAAAAACA 


CTTCTCAGGA 


TTCTGGCGTA 


CCGX XCUXGX 


OCA 


CTAAAATCCC 


TTTAATCGGC 


CTCCTGTTTA 


GCTCCCGCTC 


m/^ tv mm^m TV TV 0 

TGATTCTAAC 


O A A 

5300 


GAGGAAAGCA 


CGTTATACGT 


GCTCGTCAAA 


GCAACCATAG 


TACGCGCCCT 


5350 


GTAGCGGCGC 


ATTAAGCGCG 


GCGGGTGTGG 


TGGTTACGCG 


CAGCGTGACC 


5400 


GCTACACTTG 


CCAGCGCCCT 


AGCGCCCGCT 


CCTTTCGCTT 


TCTTCCCTTC 


5450 


CTTTCTCGCC 


% /%Mmm/«M ^^^^^^ 

ACGTTCGCCG 


GCTTTCCCCG 


TCAAGCTCTA 


AATCGGGGGC 


5500 


TCCCTTTAGG 


GTTCCGATTT 


AGTGCTTTAC 


GGCACCTCGA 


f>r+ryr*'% TV IV TV TV TV 

CCCCAAAAAA 


5550 


CTTGATTAGG 


GTGATGGTTC 


ACGTAGTGGG 


ifiMM 1 1 

CCATCGCCCT 


GATAGACGG X 


CCA A 


TTTTCGCCCT 


TTGACGTTGG 


AGTCCACGTT 


CTTTAATAGT 


GGACTCTTGT 


5o50 


TCCAAACTGG 


AACAACACTC 


AACCCTATCT 


CGGTCTATTC 


X X 1 XGA1 X X A 


K*7 A A 
O / UO 


TAAGGGATTT 


TGCCGATTTC 


GGCCTATTGG 


TTAAAAAATG 


AGCTGATTTA 


5 / DO 


x » » » % mil tin 

ACAAAAATTT 


AACGCGAATT 


mm^ % ^* * * * m 

TTAACAAAAT 


ATTAACGTTT 


AGAAXXXAAA 


cq A A 
5oOO 


XATTTGCTTA 


TACAATCTTC 


CTGTTTTTGG 


GGCilTrClti 


jvmmTVfp^TV iv 
AX X AX WiAt^ 


cocn 
jOjv 


GGGGTACATA 


TGATTGACAT 


GCTAGTTTTA 


CGATTACCGT 


TCATCGATTC 


5900 


TCTTGTTTGC 


TCCAGACTCT 


CAGGCAATGA 


CCTGATAGCC 


TTTGTAGAGA 


5950 


CCTCTCAAAA 


ATAGCTACCC 


TCTCCGGCAT 


GAATTTATCA 


GCTAGAACGG 


6000 


TTGAATATCA 


TATTGATGGT 


GATTTGACTG 


TCTCCGGCCT 


TTCTCACCCG 


6050 


TTTGAATCTT 


TACCTACACA 


TTACTCAGGC 


ATTGCATTTA 


AAATATATGA 


6100 
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GGGTTCTAAA 


AATTTTTATC 


CTTGCGTTGA 


AATAAAGGCT 


TCTCCCGCAA 


6150 


AAGTATTACA 


GGGTCATAAT 


GTTTTTGGTA 


CAACCGATTT 


AGCTTTATGC 


6200 


TCT6AGGCTT 


TATTGCTTAA 


TTTTGCTAAT 


TCTTTGCCTT 


GCCTGTATGA 


6250 


TTTATTGGAT 


GTTGGAATTC 


CTGATGCGGT 


ATTTTCTCCT 


TACGCATCTG 


6300 


TGCGGTATTT 


CACACCGCAT 


ATGGTGCACT 


CTCAGTACAA 


TCTGCTCTGA 


6350 


TGCCGCATAG 


TTAAGCCAGC 


CCCGACACCC 


GCCAACACCC 


GCTGACGCGC 


6400 


CCTGACGGGC 


TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


6450 


GTCTCCGGGA 


GCTGCATGTG 


TCAGAGGTTT 


TCACCGTCAT 


CACCGAAACG 


6500 


CGCGAGACGA 


AAGGGCCTCG 


TGATACGCCT 


ATTTTTATAG 

**A AAA A ** A 


GTTAATGTCA 


6550 


TGATAATAAT 


GGTTTCTTAG 


ACGTCAGGTG 


GCACTTTTCG 

WWrtX* A AAA 


GGGAAATGTG 


6600 


CGCGGAACCC 


CTATTTGTTT 


ATTTTTCTAA 


ATACATTCAA 


ATATGTATCC 


6650 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


TCAATAATAT 

A WmI A **** A ** A 


TGAAAAAGGA 


6700 


AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 

^k ^k A A ^k ^k 


CTTTTTTGCG 

^k ^k ^k ^k ^k tffe ^* 


6750 


GCATTTTGCC 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


6800 


AGATGCTGAA 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


6850 


TCAACAGCGG 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


6900 


ATGATGAGCA 


CTTTTAAAGT 


TCTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


6950 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


TCTCAGAATG 


7000 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


7050 


ACAGTAAGAG 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


7100 


GGCCAACTTA 


CTTCTGACAA 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


7150 


II II Ml 1 III 1/*^ * ^» » 

TTTTGCACAA 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


7200 


GAGCTGAATG 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


7250 


AGCAATGGCA 


ACAACGTTGC 


GCAAACTATT 


AACTGGCGAA 


CTACTTACTC 


7300 


TAGCTTCCCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


7350 


GGACCACTTC 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


7400 
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ATCTGGAGCC GGTGAGCGTG GGTCTCGCGG 
CAGATGGTAA GCCCTCCCGT ATCGTAGTTA 
GCAACTATGG ATGAACGAAA TAGACAGATC 
GATTAAGCAT TGGTAACTGT CAGACCAAGT 
TTGATTTAAA ACTTCATTTT TAATTTAAAA 
TTTGATAATC TCATGACCAA AATCCCTTAA 
AGCGTCAGAC CCCGTAGAAA AGATCAAAGG 
TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
GTGGTTTGTT TGCCGGATCA AGAGCTACCA 
TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
AGTTAGGCCA CCACTTCAAG AACTCTGTAG 
CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
TACGGGGTTG GACTCAAGAC GATAGTTACC 
GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
ACCGAACTGA GATACCTACA GCGTGAGCTA 
CGAAGGGAGA AAGGCGGACA GGTATCCGGT 
GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA 
CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 
GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
TCCCCTGATT CTGTGGATAA CCGTATTACC 
CGCTCGCCGC AGCCGAACGA CCGAGCGCAG 
CGGAAGAGC 
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TATGATTGCA 


GCACTGGGGC 


7450 


TCTACACGAC 


GGGGAGTCAG 


•7 er a a 
7500 


GCTGAGATAG 


GTGCCTCACT 


•7 CCA 

7550 


TTACTCATAT 


ATACTTTAGA 


7o00 


GGATCTAGGT 


GAAGATCCTT 


*7 ^ e ft 

7650 


CGTGAGTTTT 


CGTTCCACTG 


"7 "7 ft ft 

7700 


ATCTTCTTGA 


GATCCTTTTT 


7750 


AAAAACCACC 


GCTACCAGCG 


•7Q ftft 

7800 


ACTCTTTTTC 


CGAAGGTAAC 


•7 O C ft 

7850 


TGTCCTTCTA 


GTGTAGCCGT 


*7 ft ft ft 

7900 


CACCGCCTAC 


ATACCTCGCT 


*r a e ft 

7950 


A6TGGCGATA 


AGTCGTGTCT 


8000 


GGATAAGGCG 


^^^^ ^^^^ 

CAGCGGTCGG 


a ft c a 

8050 


*m ahum mm smt m ^* ^* ft 

GCTTGGAGCG 


m 4 mm\mm m mm. m^nmmi ^* 

AACGACCTAC 


8100 


TGAGAAAGCG 


CCACGCTTCC 


8150 


AAGCGGCAGG 


GTCGGAACAG 


8200 


ACGCCTGGTA 


TCTTTATAGT 


O A C ft 

8250 


CGTCGATTTT 


TGTGATGCTC 


8300 


CAGCAACGCG 


GCCTTTTTAC 


8350 


ACATGTTCTT 


TCCTGCGTTA 


8400 


GCCTTTGAGT 


GAGCTGATAC 


8450 


CGAGTCAGTG 


AGCGAGGAAG 


8500 
8509 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCCCAATACG 




nrpnp cent* fin 

V* X ww www*ww*w 


ptitpp p/*v* iv m 

ox X uuLUvjAI 


x w\JL 




AGCT6CGC6C 




Aw. x uAVjuLLu 




vUvL w-tiw-uc* x 


1UU 


CG66C6ACCT 


TTGGTCGCCC 

* X w w X ww w W W 


wVJv#V» X wAV? X V7 


n tic* CI n nrri'h n 


UbUbuAbAUA 




GGGAGTGGCC 


AACTCCATCA 


V-» x nvjuuu X X w 


LllblAVjl XA 


AlbAl IaALL 


2UG 


CGCCATGCTA 


CTTATCTAPA 

w X X AX wiilvA 


X wAl VUAl UA 


A rprnpo TV omm 
Al X UuAuLl X 


bLAl but X U w 


OCA 


AGGTCGTTAC 


ATAACTTACC 


GTA A ATCfSPf* 

UiAAAlubLL 


en. nnrvfir* n*vr* 


AwwtjwwwAAw 


a 

JUU 


GACCCCCGCC 


CATTGACGTr? 

x x uau w x w 


A A T A A TVS A fY2 

AAl AA 1 VAvVJ 


lAlbi iwvvA 


lAblAACvvwv 


OCA 


AATAGGGACT 


TTCCATTGAC 


GTCAATGGGT 

x W1AA WW X 


GGAGTATTTA 

ww*AwX AX X XA 


Pn/TT 1 AAA PTfi 


Af\n 


CCCACTTGGC 


AGTACATCAA 


GTGTATCATA 

w a %jxxix vaia 


TGCCAAGTAC 

X UWW1AV3 X Aw> 


w w wwwwX AX X 


f!3U 


GACGTCAATG 


ACGGTAAATG 


GCCCGCCTGG 


CATTATGCCC 


AGTACATGAC 


500 


CTTATGGGAC 


TTTCCTACTT 


GGCAGTACAT 


CTACGTATTA 


GTCATCGCTA 


550 


TTACCATGGT 


GATGCGGTTT 


TGGCAGTACA 


TCAATGGGCG 


TGGATAGCGG 


600 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


650 


TTTGTTTTGG 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


700 


CCGCCCCATT 


GACGCAAATG 


GGCGGTAGGC 


GTGTACGGTG 


GGAGGTCTAT 


750 


ATAAGCAGAG 


CTCGTTTAGT 


GAACCGTCAG 


ATCGCCTGGA 


GACGCCATCC 


800 


ACGCTGTTTT 


GACCTCCATA 


GAAGACACCG 


GGACCGATCC 


AGCCTCCGGA 


850 


CTCTAGAGGA 


TCCGGTACTC 


GACCCGAGCT 


CGGATCCACT 


AGTAACGGCC 


900 


GCCAGTGTGC 


TGGAATTCTG 


CACTCCAGGC 


TGCCCGGGTT 


TGCATGCTGC 


950 


TGCTGCTGCT 


GCTGCTGGGC 


CTGAGGCTAC 


AGCTCTCCCT 


GGGCATCATC 


1000 



WO 96/39530 




PCT/US96/10245 



109 





tv oo tv o tv tv ooo 
AGGAGAACCC 


GGAUX XwXfcrl? 




O TA O OOO TV OO O 


1UDU 


f*f*tt*f* o o mo o o 
CCTGGGTGCC 


OOO TV TV O TV TV 

G C CAAG AAG C 


m/«^1k O 0* 0*Tjy0* 0+ 

TGCAGCCTGC 


ACAGACAGCC 


OOOT\ TV O TV TV OO 

GCCAAGAACC 


11UU 


XwAiwii L. XI 


f*f*rr\r* o o oo tv m 

CCTGGGCGAT 


0*0* r* tv tvfir*nr*f* 
GGGAX uuuiji? 


m/^ mom tv prrT 


OTA f^Tif^f^tT^f^f^r* 

bAwib UXbbb 


1J.OU 


it oo tv ftip^m^ iv 
AGGAXCCTAA 


TV TVOOOOTVOX "» 

AAGGG C AG AA 


OTVTVOOTAOTATA7V 

GAAGGACAAA 


f*irnr*r*r*r* r*r*ifif* 

CTGGGGuCTG 


tv 0 tv rpTV r*r*r*r*n) 
AGAXACCCC1 


XzUU 


0*0* 0*0* *\ moo tv o 
GGCGATGGAC 


0*0* on HII^Ami m 

CGCTTCCCAT 


ATGTGGCTCT 


r+rT\0*r**\ tv o TV otv 
GTCCZAAGAuA 


rnTV OTV TV rnomTV O 
XAUAAXGXAG 




TV O TV TV TV OTA mom 

ACAAAvJAX G X 


OOO TV O TV OTV om 

GU (J AG A CJ AG X 


OOTVOOOTAOTVO 

GGAGw wiUAb 


r+f*?\ Pr^PPTH 
LwltbbUw X A 


ooffiomo 00 00 
LblblbLbbb 


1JUU 


omoTv TA OOOOTV 

GTCAAGGGCA 


TV OIIMIIOOTV O X 

ACTTCCAGAC 


^ *A TT\TT\r* O tM III HO 

CATTGG CTTG 


tv 0 moo tv r*r*r* 

AGTGCAGCCG 


r* f* ^*f* f*n*fftrft tv ^ 
CGuGGX X XAA 


1JDU 


nnik omooTV iv f* 

CCAGTGCAAC 


TV OO Tk O Tfc ^^*t 

ACGACACGCG 


^•^.H. "Ik 0*0* Tk 0*0*111 

GCAACGAGGT 


CATCTCCGTG 


tv mo tv tv mf*f*f*r* 
ATGAATCGGG 


X4UU 


0m 0% * M % » m 0m a 

CCAAGAAaGC 


AGGGAAGTCA 


0+.tT\0+0+0+ *\ 0*^n0* 0* 

6TGGGAGTGG 


TAACCACCAC 


TV f*f+ H. OrflOOTV 0 

ACGAGTGCAG 




CACGCCTCGC 


CAGCCGGGAC 


0mrw*^ 0+0+ 0m 0m 0m m\ 0m 

CTACGCCCAC 


% 0*0* 0mrry0* *i n 0*0* 

ACGGTGAACC 


GCAACTGGTA 


1500 


CTCGGACGCC 


GACGTGCCTG 


^rnJt ^^ffn^^^V 0+\ ^+%^m*t ^m%^m 

CCTCGGCCCG 


0m0m^. 0m 0+ 0+0+0+0+ 

CCAGGAGGGG 


n*.0+0+i0+.*\ 0* 0* *x 0*j*x 

TGCCAGGACA 


1550 


TCGCTACGCA 


0+ «%m 0m +k #*i 0m 

GCTCATCTCC 


AACATGGACA 


TTGATGTGAT 


0+.0*jm*X 0*0 Ml \0+0* Tk 

CCTAGGTGGA 


4 0* 0\0\ 

1600 


GGCCGAAAGT 


ACATGTTTCG 


0m +\ m 0m 0m 0+ m\ ^ 0m0m 

CATGGGAACC 


0+0+^ 0+m\ 0+0+0+w+*0+ 

CCAGACCCTG 


« % 0+0*0* % m 

AGTACCCAGA 


•+ 0? tr f\ 

1650 


n*0* » Ti o iv o o 

TGACTACAGC 


CAAGGTGGGA 


0m 0% m\ 0m 0* 0*n*0+ 0* *x 

CCAGGCTGGA 


0m 0m 0* 0* *x *\ 0+ lk *\ m 

CGGGAAGAAT 


Mffto 0 1 no 0 tv 00 

CTGGTGCAGG 


1700 


AATGGCTCGG 


0m 0m « ^« «^ ^« ^m Tk m 

CGAACGCCAG 


0+0+m0+) 0+0*0+0* 4+t'W% 

GGTGCCCGGT 


ACGTGTGGAA 


m^^^ ^ tv omo TV 0 

CCGCACTGAG 


1750 


omo tv mo o tv oo 
CTCATGCAGG 


OIIUIIOO# \\W0*0* TV 

CTTCCCTGGA 


CCCGTCTGTG 


tv 0*.0*0**\ m^wnoTV 

ACCCATCTCA 


TGGGTCTCTT 


loUU 


|TIO TV f* r*f+TT\f* f* TV 

TGAGCCTGGA 


o tv o tv mo tv tv tv m 

GACATGAAAT 


TV f*f* TV ^HfnOAl 

ACGAGATCCA 


0*0*0* tv 0 tv omoo 

CCGAGACTCC 


tv otv omoo TV 00 
ACACXGGACCJ 


XoDU 


CC X CCCTGAT 


0*0+ *\ M Tk mo TV O TV 

GGAGATGACA 


0* *x 0* 0* 0*rt*0* 0*0* 0* 

GAGGCTGCCC 


TGCGCCTGCT 


OTVOOTVOTAOTAO 

GAGCAGACAy 


1 OAA 

xy uu 


t*f*%f*f*r* f*r* f* f*TT\ 
CCCCGCGGCT 


TCTTCCTCTT 


0*0*n*.0* 0* n. 0* 0* 0*m 

CGTGGAGGGT 


0* 0*n*.0*0* 0+.*\ rTy0*0* 

GGTCGCATCG 


tv 00 tv mo omo a 
ACCAXGGXCA 


lyou 


moTv iv Ti tv f+r+ 

TCATGAAAGC 


% *+ r* *+ + vii ii i ilk 

AGGGCTTACC 


0* 0* ^ ^m mi /hum lk 0+ 

GGGCACTGAC 


r**0* m 0* *x 0*}0+ *\ wv\0*. 

TGAGACGATC 


tv rno f TV 1 IOO TV OO 

ATGXXCGACG 


O r\AA 


TV r*f+ 0*0*. *\ f f If f IO Tk 

ACGCCATTGA 


M 0*0+0+0*0+0+0+0% 

GAGGGCGGGC 


m« 0* 0*nrt0* n 0+0+ *\ 

CAGCTCACCA 


0+ 0*0* *\ 0* 0* *\ 0* 0* *x 

GCGAGGAGGA 


0 tv 00 omo TV 00 
CACGCTGAGu 


O AK A 


CTCGTCACTG 


CCGACCACTC 


CCACGTCTTC 


TCCTTCGGAG 


GCTACCCCCT 


2100 


GCGAGGGAGC 


TCCTTCATCG 


GGCTGGCCGC 


TGGCAAGGCC 


CGGGACAGGA 


2150 


AGGCCTACAC 


GGTCCTCCTA 


TACGGAAACG 


GTCCAGGCTA 


TGTGCTCAAG 


2200 


GACGGCGCCC 


GGCCGGATGT 


TACCGAGAGC 


GAGAGCGGGA 


GCCCCGAGTA 


2250 


TCGGCAGCAG 


TCAGCAGTGC 


CCCTGGACGA 


AGAGACCCAC 


GCAGGCGAGG 


2300 
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ACGTGGCGGT GTTCGCGCGC GGCCCGCAGG 



GAGGAGuAGA 


LL1 IvAlAuL 


CP A CCTC ATG 


GCCCTACACC 


GCCTGCGALC 




CGCACCCGGG 


GCGGTCCGTG 




ACCCTGCTGC 


TGCTGGAGAC 


GGCCACTGCT 


GGGCTCCTGC 


TTCCCCATCC 


CGGAGTTCTC 


TCCTGCCTGG 


CCTCCAGCCC 


GAGTCGTCAT 


GGTCCTGCGA 


TGGAACCTTC 


CCCTCCCCGT 


ATGACACCAA 


ACCTGCCCCT 


TGGCTGCTuT 


GAGGGACTGC 


AGGTTGTGCC 


CTGTGGCTGC 


GGGCTCAGGC 


CATCGAGCGA 


CCACCTACAG 


TGGTCAGTCT 


GGAGGATGAC 


GTGGCGTGAA 


CATGATAAGA 


TACATTGATG 


AGTTTGGACA 


GAAAAAAATG 


CTTTATTTGT 


GAAATTTGTG 


n mm Tk mn 

ACCATTATAA 


GCTGCAATAA 


7k OTk 7k ^•mm Ik 7k 0 

ACAAGTTAAC 


m x m/ * n ii i im^ 7k ^ 

TATGTTTCAG 


GTTCAGGGGG 


AGGTG I GGGA 
















E\s\s\l*sis\s\Sis\s\ 








11 n n jn s% n li « 11 s% 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


CATGACTACG 


TCCGGCGTTC 


CATTTGGCAT 


CTCGGTTGTC 


TCGGCGCACT 


CCGTACAGTA 


GAGACAGAAA 


CCCGCGCTAC 


CATACTGGAG 


ATGTAACACT 


TTGACAATGC 


ACAACGTGAG 




PCT/US96/1024S 



CGCACCTGGT 


TCACGGCGTG 


2350 


GCCTTCGCCG 


CCTGCCTGGA 


2400 


CGCCGGCACC 


ACCGACGCCG 


2450 


TGCTTCCTCT 


GCTGGCCGGG 


2500 


CCOTGAGTGT 


CCCGTCCCTG 


2550 


OTGCTCCCCA 


CCTCCTGTCG 


2600 


VVvU\7V7AV7 X W 


CCTATACAGA 


2650 


GCGCTCTGGG 


GACTGAGCCC 


2700 


CGGACTCCCT 


ACCCCAACCC 


2750 


CTGCACCCCA 


GGAAAGGAGG 


2800 


CCCAGTGGGG 


TCGAGACAGA 


2850 


GCTGGCCGCG 


GGGATCGAGA 


2900 


A A PC AC A ACT 


AGAATGCAGT 


2950 


ATGCTATTGC 


TTTATTTGTA 


3000 


AACAACAATT 


G CATTCATTT 


3050 


GGTTTTTTCG 


GATCCTCTAG 


3100 


NNNNNNNNNN 


NNNNNNNNNN 


3150 


NNNNNNNNNN 


NNNNNNNNNN 


3200 


NNNNNNNNNN 


NNNNNNNNNN 


3250 


NNNNNNNNNN 


NNNNNNNNNN 


3300 


NNNNNNNNNN 


NNNNNNNNNN 


3350 


NNNNNNNNNN 


NNNGGATCCC 


3400 


GACACTACGA 


CCAACACGAT 


3450 


GGGATCGTCT 


ACCTCCTTTT 


3500 


GATCATCCGC 


TGCTGCCCGA 


3550 


TTACGTGCGA 


GGTCTTCCCT 


3600 
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GCAGTGTGGG 

UWIU X w X wUU 


AX X X Aww w X w 


ATTCAGGAAT 

AX X wAww'AAX 


www X X w X X WW 


CTGGGATATG 

W J. www £IX AX w 


3650 




VJ\jVjn(JV9AUU X 


X wX AAX WW X VJ 


AGGAAGTGTA 

AUUAAU X w X A 


TGCACGTGTG 

X w* wAww X w X w 


3700 


PPTPTG , T rr nGT 
UwlwiwA XwX 


vVWUlWli X w 


ATATCATGAC 

AXAX WAX w Aw 


GAGCATGATG 

VJAwvA X UA1 w 


ATCCATGGTT 

Al W WAX ww X X 


3750 


apgagtpptg 


uut XUX LUiiL 


XuXvAXXvjI X 


ppaptpppgp 

w wA\3 X LUUUUt 


TTPPPTfiPAC 

X X www X UUAu 


■SOW 




G wGGG waGG X 


^n^n^n^*^* ^^^^ tv 

X X XbbLLAbL 


TPPTTTA PP A 
Ibbl X X AGGA 


tv p rpp P TVlfZ A 
XbbXbbXubA 




X GGCGw wax G 


rpmrpA TV mnTi f TV 
X X XAAX CAGA 


r* r* ttt tv m a tr*r* 
GG X X X A XAX G 


G X Aw wGGGAG 


f*Pf*ffpf A ATT 
uXuuluAAl X 




AwAAwA 1 buL 


TV TV TV TV P TV/* TV 

AAAAGAGGTA 


ATGTTTATGT 


ppTi ^«^/*tptt 
CCAGCG X G 1 X 


TATP APPPPT 
XAX <j AG Vj V* w* X 




CGCGAOxXAA 


TCTACCTGCG 


CTTGTGGTAT 


GATGGCCACG 


IbbbXlLlbX 


Anno 

4UUU 




a hp^^ a p pn "imiip 

AXuAuCX XXu 


uA X ALAu w w* w 


PTTP P A PTP'I* 
wX X UwAwX W*X 


GGG A TTTTG A 
V7\JUA1 XXX UA 


*t V<J V/ 


A PA ATA TTPT 


phitg PTnT/i p 

Vjvj X uV# Ivl VJV* 


XuWiuX X AwX 


PTP ptp A flupT 
uluLXuAl X X 


AAGTGAGATC 

AA w X w A w A X w 


4100 


7a /ip/stgp^pt 

Aw*w-w* X w* UuL X 


fiPTfiTfipppn 


PAPP A PA APP 
uAbuAUAAuu 


ppppttatgp 
wOwwX XaX ww 


X w Wwww WWW X 




PPPA ATPATP 




pp a pt^pp a t 

wwAwX w*wwAX 


/*I|M|I/ •III A HP^P^^^^ 

ul luiAl X ww 


rn/;p a <2(Z A PGG 




AGwGGwGGwG 


GUAGwAGX X X 


IV ITWPPP PP PP P 

Ax XCGCGCGC 


TPPTPPA PPA 
X G w X VzUauLA 


pp a nnnnnn^ 

w UAL w o w w w X 




jv TPPTP tv TPP 
AX Ltl GAX Gw 


TV PP A TT A TP A 
AwGA X X A X GA 


wXwXAwwwww 


ATPTA PPP AT 


PPPPATPAPT 
wwwwAX wAwX 




AG X GwGGC CG 


CGGGGAX CuA 


TV /^TV m/^ TV rnTV TV 

GAGaTGATAA 


r* tv rpTV otv rprn/"*7A 
GATACATTGA 


TP A PTTT^m A 
X bAbX X X bbA 


/OCA 


P 71 71 71 PP A P 71 A 


UXAVjaAXuLA 


/■•TP A A A A A A A 
bibAAAAAAA 


HPP PHPT^HP A HP HP HP 
XGwX X XAX X X 


\a X uAAAX llw 


4AOO 

*« *« V/ v/ 


TtfZAT^PTA'PT 


bLX X XAX XXu 


lAALUAl XaX 


AAV w X urtAAX 


A A AP A AGTTA 

AAAwAAw*X XA 


4450 

*x "s *^ w 


71 pa 7ir*7i 7i pa a 


TTPPflTTPTiT 
X IbUAJ. XwiX 


TTT a TPTTTP 
X X XAXuX X X w 


A PPTTP A PPG 
A w* u* X X Uiuuu 


GG A GGTGTGG 

wwAww X w X ww 


4500 


uAubl X X X X X 


pppatpptpt 

UubAl ww XwX 


A P A PTPP A PP 
AtfAvs X Ww*Aw w 


TP p A PPP A TP 

X W* WAW* W* WA X w 


PA AGPTGTAG 

wAAw W X w X Aw 


4550 


a X aau X Aw-wA 


TPPPPPPTTA 
XGGwGGGX XA 


TjfPPTi HP HP A A PHP 
AXWiX XAAwX 


A PA APPA APP 
AuAAvjbAAtL 


PPTAGTPGATG 

w w X Aw X UAX \3 


4600 


GAGX X GGwwA 


wX till wX wX 




HPPP ptp A PHP P. 
X ww*wX wAwXV7 


A (zcznccxcxcxcci 

Aw w w wwVJw ww 


4650 


ACCAAAGGTC 


GCCCGACGCC 


CGGGCTTTGC 


CCGGGCGGCC 


TCAGTGAGCG 


4700 


AGCGAGCGC6 


CAGCTGGCGT 


AATAGCGAAG 


AGGCCCGCAC 


CGATCGCCCT 


4750 


TCCCAACAGT 


TGCGCAGCCT 


GAATGGCGAA 


TGGAANTTCC 


AGACGATTGA 


4800 


GCGTCAAAAT 


GTAGGTATTT 


CCATGAGCGT 


TTTTCCTGTT 


GCAATGGCTG 


4850 


GCGGTAATAT 


TGTTCTGGAT 


ATTACCAGCA 


AGGCCGATAG 


TTTGAGTTCT 


4900 
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TCTACTCAGG 


CAAGTGATGT 


TATTACTAAT 


CAAAGAAGTA 


TTGCGACAAC 


4950 


66TTAATTT6 


CGTGATGGAC 


AGACTCTTTT 


ACTCGGTGGC 


CTCACTGATT 


5000 


ATAAAAACAC 


TTCTCAGGAT 


TCTGGCGTAC 


CGTTCCTGTC 


TAAAATCCCT 


5050 


TTAATCGGCC 


TCCTGTTTAG 


CTCCCGCTCT 


GATTCTAACG 


AGGAAAGCAC 


5100 


GTTATACGTG 


CTCGTCAAAG 


CAACCATAGT 


ACGCGCCCTG 


TAGCGGCGCA 


5150 


TTAAGCGCGG 


CGGGTGTGGT 


GGTTACGCGC 


AGCGTGACCG 


CTACACTTGC 


5200 


CAGCGCCCTA 


GCGCCCGCTC 


CTTTCGCTTT 


CTTCCCTTCC 


TTTCTCGCCA 


5250 


CGTTCGCCGG 


CTTTCCCCGT 


CAAGCTCTAA 


ATCGGGGGCT 


CCCTTTAGGG 


5300 


TTCCGATTTA 


GTGCTTTACG 


GCACCTCGAC 


CCCAAAAAAC 


TTGATTAGGG 


5350 


TGATGGTTCA 


CGTAGTGGGC 


CATCGCCCTG 


ATAGACGGTT 


TTTCGCCCTT 

AAA VWVWA A 


5400 


TGACGTTGGA 


GTCCACGTTC 


TTTAATAGTG 

mm mm * At * mm A ^b 


GACTCTTGTT 


CCAAACTGGA 


5450 


ACAACACTCA 


ACCCTATCTC 

* m AA* mm mm 


GGTCTATTCT 


TTTGATTTAT 

a x a \jn x a Jim 


AAGGGATTTT 

XiTlWVJwri J. AAA 


5500 


GCCGATTTCG 


GCCTATTGGT 


TAAAAAATGA 


GCTGATTTAA 


CAAAAATTTA 


5550 

•rf W m+ W 


ACGCGAATTT 


TAACAAAATA 


TTAACGTTTA 

^b M ^b ^b~ ^b* A 


CAATTTAAAT 


ATTTGCTTAT 


5600 


ACAATCTTCC 


TGTTTTTGGG 


GCTTTTCTGA 


TTATCAACCG 


GGGTACATAT 

^» ^m^my mm 4m m Jfc ^b A A ^b 


5650 


GATTGACATG 


CTAGTTTTAC 


GATTACCGTT 


CATCGATTCT 


CTTGTTTGCT 


5700 


CCAGACTCTC 


AGGCAATGAC 


CTGATAGCCT 


TTGTAGAGAC 


CTCTCAAAAA 


5750 


TAGCTACCCT 


CTCCGGCATG 


AATTTATCAG 


CTAGAACGGT 


TGAATATCAT 


5800 


ATTGATGGTG 


ATTTGACTGT 


CTCCGGCCTT 


TCTCACCCGT 


TTGAATCTTT 


5850 


ACCTACACAT 


TACTCAGGCA 


TTGCATTTAA 


AATATATGAG 


GGTTCTAAAA 


5900 


ATTTTTATCC 


TTGCGTTGAA 


ATAAAGGCTT 


CTCCCGCAAA 


AGTATTACAG 


5950 


GGTCATAATG 


TTTTTGGTAC 


AACCGATTTA 


GCTTTATGCT 


CTGAGGCTTT 


6000 


ATTGCTTAAT 


TTTGCTAATT 


CTTTGCCTTG 


CCTGTATGAT 


TTATTGGATG 


6050 


TTGGAANTTC 


CTGATGCGGT 


ATTTTCTCCT 


TACGCATCTG 


TGCGGTATTT 


6100 


CACACCGCAT 


ATGGTGCACT 


CTCAGTACAA 


TCTGCTCTGA 


TGCCGCATAG 


6150 


TTAAGCCAGC 


CCCGACACCC 


GCCAACACCC 


GCTGACGCGC 


CCTGACGGGC 


6200 
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TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


GTCTCCGGGA 


6250 


GCTGCATGTG 


TCAGAGGTTT 


TCACCGTCAT 


CACCGAAACG 


CGCGAGACGA 


6300 


AA666CCTC6 


TGATACGCCT 


ATTTTTATAG 


GTTAATGTCA 


TGATAATAAT 


6350 


GGTTTCTTAG 


ACGTCAGGTG 


GCACTTTTCG 


GGGAAATGTG 


CGCGGAACCC 


6400 


CTATTTGTTT 


ATTTTTCTAA 


ATACATTCAA 


ATATGTATCC 


GCTCATGAGA 


6450 


CAATAACCCT 


GATAAATGCT 


TCAATAATAT 


TGAAAAAGGA 


AGAGTATGAG 


6500 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


CTTTTTTGCG 


GCATTTTGCC 


6550 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


AGATGCTGAA 


6600 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


6650 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


ATGATGAGCA 


6700 


CTTTTAAAGT 


TCTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


TGACGCCGGG 


6750 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


TCTCAGAATG 


ACTTGGTTGA 


6800 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


ACAGTAAGAG 


6850 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


GGCCAACTTA 


6900 


CTTCTGACAA 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


TTTTGCACAA 


6950 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


7000 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


AGCAATGGCA 


7050 


ACAACGTTGC 


GCAAACTATT 


AACTGGCGAA 


CTACTTACTC 


TAGCTTCCCG 


7100 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


GGACCACTTC 


7150 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


ATCTGGAGCC 


7200 


GGTGAGCGTG 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


7250 


GCCCTCCCGT 


ATCGTAGTTA 


TCTACACGAC 


GGGGAGTCAG 


GCAACTATGG 


7300 


ATGAACGAAA 


TAGACAGATC 


GCTGAGATAG 


GTGCCTCACT 


GATTAAGCAT 


7350 


TGGTAACTGT 


CAGACCAAGT 


TTACTCATAT 


ATACTTTAGA 


TTGATTTAAA 


7400 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


GAAGATCCTT 


TTTGATAATC 


7450 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


AGCGTCAGAC 


7500 
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CCCGTAGAAA 


AGATCAAAGG 


ATCTTCTTGA 


GATCCTTTTT 


TTCTGCGCGT 


7550 


AATCTGCTGC 


TTGCAAACAA 


AAAAACCACC 


GPT A PP AGPG 

wwX AwwAwww 


w X wU lXlull 


/ DUU 


TGCCGGATCA 


AGAGCTACCA 


Av XwXXXXXw 


PG 71 2V fZCW 71 71 P 

UUAAUU X AAw 


X bull X wAur w 


7/rcft 
/ ODU 


AGAGCGCAGA 


TACCAAATAC 


Ivl ww X X w X A 




71 ^TT 71 np PP 71 
AUl X nbbWWi 


/ /UU 


CCACTTCAAG 


AACTCTGTAG 


C ACCG C PT A P 

vAv ww w w lAv 


ATA PPTPGPT 

AX Aw w X wV7w X 


PTfiPTlATPP 
wX UwXAAX ww 


/ /DU 


TGTTACCAGT 


GGCTGCTGCC 


AGTGGPGATA 

AwX OV? UUAlA 


71 CTPCT^TPT 
AO X wVar lVJlUl 


lAULbbb lib 


/ oUU 


GACTCAAGAC 


GATAGTT A PP 

wnXAwX X«%ww 


wwA X AAUVltU 


WibLub X w^jrw* 


Cr^WC TV 7A C*Clf*C* 
w- w X bAALbbb 


/ODD 


GGGTTCGTGC 


ACAPAGPPPA 
a wA wA w w w wi 


f2pfT w T 1 PP2l rpr 




7V r*r*f* 7i TV flTI/"* ■» 

ACJCwAAUXGA 


^ ft f\ f\ 

7900 


GATACCTAPA 


GPGTfZ A CPT 2k 


rp/-« TV TV TV 7V pr 


CCACGCTTCC 


CGAAGGGAGA 


^ rt c 

7950 


AAGGCGGAPA 

Xlnww wwwXlwA, 




AAbLvjbWiGG 


GTCGGAAwAG 


GAGAGCGCAC 


A AAA 

8000 


GAGGGAGCTT 


CCAGGGGGAA 


APCPPTPPTa 


X 11 X lAlAbl 


ww lbl LbbbT 


o n c f\ 
8050 


TTCGCCACCT 


CTGACTTGAG 


CGTCGATTTT 


TGTGATGCTC 


GTCAGGGGGG 


8100 


CGGAGCCTAT 


GGAAAAACGC 


CAGCAACGCG 


GCCTTTTTAC 


GGTTCCTGGC 


8150 


CTTTTGCTGG 


CCTTTTGCTC 


ACATGTTCTT 


TCCTGCGTTA 


TCCCCTGATT 


8200 


CTGTGGATAA 


CCGTATTACC 


GCCTTTGAGT 


GAGCTGATAC 


CGCTCGCCGC 


8250 


AGCCGAACGA 


CCGAGCGCAG 


CGAGTCAGTG 


AGCGAGGAAG 


CGGAAGAGC 


8299 
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WHAT IS CLAIMED IS: 

1. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell ex 
vivo comprising the steps of providing a recombinant 
adeno-associated virus comprising: (a) the DNA of at 
least a portion of the genome of an adeno-associated 
virus which portion is capable of transducing a selected 
gene into a target cell in the absence of cell division; 
and (b) a selected gene operatively linked to regulatory 
sequences directing its expression , said gene flanked by 
the DNA of (a) and capable of expression in the target 
cell; infecting said target cells with said recombinant 
adeno-associated virus; contacting said infected cells 
with an agent which facilitates the conversion of said ss 
recombinant virus to its double stranded form, wherein 
said conversion occurs in said target cell, resulting in 
enhanced transduction of said recombinant virus into said 
target cell. 

2. The method according to claim 1 wherein said 
agent is a helper virus comprising a selected gene 
encoding a polypeptide which can enhance the conversion 
of said single-stranded recombinant virus to double- 
stranded recombinant virus or functional fragment 
thereof, said helper virus capable of transducing said 
selected gene into the target cell in the absence of cell 
division, 

3. The method according to claim 2 wherein said 
contacting step comprises co-infecting said target cell 
with said helper virus. 

4. The method according to claim 3 wherein said 
helper virus is an adenovirus. 
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5. The method according to claim 2 wherein said 
selected gene is the adenovirus E4 gene or a functional 
fragment thereof. 



6. The method according to claim 2 wherein said 
adenovirus contains two selected genes or functional 
fragments thereof , said genes being the adenovirus E4 
gene and El gene. 



7. The method according to claim 5 or 6 wherein 
said functional fragment of said E4 gene comprises the 
open reading frame 6 of E4. 

8. The method according to claim 1 wherein said 
recombinant adeno-associated virus further comprises a 
second selected gene operatively linked to regulatory 
sequences capable of directing expression of said second 
gene, said second gene capable of facilitating the 
conversion of said single stranded recombinant virus to 
its double stranded form upon expression, and capable of 
co-expression with said first selected gene in the target 
cell, whereby the second gene product is expressed in the 
target cell, wherein said conversion occurs in said 
target cell, resulting in enhanced transduction of said 
recombinant virus in said target cell. 

9. The method according to claim 8 wherein the 
regulatory sequences directing expression of said second 
gene comprise an inducible promoter, and wherein 
expression of said second gene occurs in the presence of 
an inducing agent. 
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10. A recombinant adeno-associated virus comprising 
(a) the DMA of at least a portion of the genome of an 
adeno-associated virus which portion is capable of 
transducing at least two selected genes or functional 
fragments thereof into a target cell in the absence of 
cell division; (b) a first selected gene operatively 
linked to regulatory sequences directing its expression, 
(c) a second selected gene operatively linked to 
regulatory sequences capable of directing expression of 
said second gene, said second gene capable of 
facilitating the conversion of said single stranded 
recombinant virus to its double stranded form upon 
expression, said first and said second genes flanked by 
the ONA of (a) , said first and second gene capable of co- 
expression in the target cell. 

11. The recombinant virus according to claim 10 
wherein said second gene is selected from the group 
consisting of an adenovirus E4 gene, ORF6 of E4 and a 
functional fragment thereof. 

12. The recombinant virus according to claim 10 
wherein said second gene is the adenovirus El gene or a 
functional fragment thereof. 

13. The recombinant virus according to claim 10 
wherein the regulatory sequences directing expression of > 
said second gene comprise an inducible promoter, and 
wherein expression of said second gene occurs in the 
presence of an inducing agent. 
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14. The recombinant virus according to claim 10 
further comprising (d) an additional selected gene 
operatively linked to regulatory sequences capable of 
directing expression of said gene, said additional gene 
and said second gene capable of jointly facilitating the 
conversion of said single stranded recombinant virus to 
its double stranded form upon expression of both said 
second and additional genes, said first, second and 
additional genes flanked by the DNA of (a) , and capable 
of co-expression in the target cell. 

15. The recombinant virus according to claim 14 
wherein said second gene is the adenovirus E4 gene or a 
functional fragment thereof and said additional gene is 
the adenovirus El gene or a functional fragment thereof. 

16. The recombinant virus acc ordi ng to claim 15 

wherein said functional fragment of E4 is the ORF6 
sequence. 

17. The recombinant virus according to claim 15 
wherein the regulatory sequences directing expression of 
said additional gene comprise an inducible promoter, and 
wherein expression of said additional gene occurs in the 
presence of an inducing agent. 

18. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell 
comprising the steps of: providing a recombinant adeno- 
associated virus of claim 10 through 16; infecting said 
target cells with said recombinant virus; and culturing 
said infected cells under conditions which enable 
expression of said selected genes in said target cell, 
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wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant 
virus in said target cell. 

19. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell 
comprising the steps of: providing a recombinant adeno- 
associated virus of claim 17; infecting said target cells 
with said recombinant virus; and contacting said target 
cell with an inducing agent which induces the expression 
of said second or additional gene in said target cell, 
wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant 
virus in said target cell. 

20. A pharmaceutical composition comprising a 

r f c S >mbin ? r ? t ^adeno-awo^ated virus _ and ^ an agent which _ 
facilitates the conversion of said ss recombinant virus 
to its double stranded form in a target cell. 

21. The composition according to claim 20 wherein 
said virus is selected from the group consisting of the 
recombinant viruses of claims 10-16. 

22. The composition according to claim 20 wherein 
said virus is the recombinant virus of claim 17 and 
further comprising an inducing agent which induces the 
expression of said second or additional gene in said 
recombinant virus. 

23. A mammalian cell transduced with the 
recombinant adeno-associated viruses of claims 10-17. 
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CONC. OF DEX foM) 
YIELD AT 24 HR PJ. (LFU/ml) 

ORF6 PROTEIN DETECTED AT 24HR P.L IN 27-18 CELLS (Abs.mm> 



FIG. 4A 
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